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PRODUCTION OP INSULIN-LIKE GROWTH 
FACTOR-1 IN XETHYLOTROPHTC YEAST CELLS 

RELATED APPLICATIONS 

5 This application is a continuation-in-part of U.S. 

Application Serial No. 07/578,728, to Brier ley et aj^, 
filed September 4, 1990, "PRODUCTION OP INSULIN-LIKE ' 
GROWTH FACTOR-l IN METHYLOTROPHIC YEAST CELLS". The 
subject matter of this application is also related to 

10 U.S. Application Serial No. 07/678,916, to Gleeson e± 
Sl^, filed April 1, 1991, "GENES WHICH INFLUENCE PICHIA 
PASTOR! g PROTEOLYTIC ACTIVITY, AND USES THEREFOR". U.S 
Application Serial Nos. 07/578,728 and 07/678,916 are 
incorporated herein in their entirety by reference 

15 thereto. 

FIELD .OF THE INVENTION 

The invention described herein relates to processes 
for producing authentically folded biologically active 
insulin-like growth factor-l (IGF-I) using aethylotrophic 
yeast host cells, such as Eichia pastor^ (EjL sss ^ 3 r i&) . 
The invention further relates to the methylotrophic yeast 
that produce IGF-I, to DNA fragments for preparing 
expression vectors that include DNA encoding IGF-I and 
that are used for transforming the methylotrophic yeast 
cells, to the expression vectors, and to cultures 
containing the transformed yeast cells. 
BACKGROUND OF THE INVENTION 

Insulin-like growth factor-l belongs to a 
heterogeneous family of peptides which share some of the 
biological and chemical properties of insulin, but which 
are antigenically distinct from insulin. Currently 
available experimental evidence suggests that IGF-i 
promotes growth by mediating the effects of growth 
hormone. Thus, such processes as skeletal growth, cell 
replicati n and oth r growth related pr cesses are 
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affected by IGF-1 levels. Physiological concentrations 
of IGF-1 have been shown to be influenced by such 
conditions as thyroid disease, diabetes and malnutrition 
(see Preece (1983) Horm. Blood , 4: 108). IGF-1 has also 
5 been shown to act synergistically with other growth 

factors, for example, in accelerating the healing of soft 
and mesenchymal tissue wounds (see Lynch et al. (1989) j. 
Clin. Periodontol - . l£: 545; and Lynch £fc &Li_ (1987) 
Froc. Natl. Acad- Sci. hsa. 84: 7696) , and in enhancing 

10 the growth of mammalian cells .in serum-free tissue 
culture medium (see Burleigh efc al. (1986) American 
Biotech- Lab. r 4: 48). 

Considering the many clinical and research 
applications of IGF-1, a ready supply of IGF-1 would be 

15 of great value to the medical and biotechnology fields. 
Since isolation from natural sources is technically 
difficult, expensive, and time consuming, recent efforts 
have centered on the development of efficient recombinant 
methods for the production of IGF-l. 

20 Among host cells that have been used for the 

production of heterologous proteins, J2±. coli and 
Saccharomvces cerevieiaA (Baker's yeast) are probably the 
best characterized. Insulin-like growth factor-1 (IGF- 
1) , which is a polypeptide of 70 amino acids with a 

25 molecular weight of 7648 daltons, is a single chain 
protein that has three intrachain disulfide bridges. 
These disulfide bonds, along with numerous hydrogen bonds 
and hydrophilic interactions, maintain the compact 
tertiary structure of this molecule. coli . however, 

30 does not possess the ability to produce disulfide bonds 
in proteins, so that proteins, such as IGF-1, that 
include disulfide bonds, when cloned into and expressed 
in 1^ coll. frequently are not stable and tend to 
aggregate into inactive complexes. In addition, IGF-l 

35 produced in coli has to be xtracted and treated with 
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oxidizing agents to produce the disulfide bonds. Upon 
reduction and reoxidation, IGF-i refolds in a variety of 
ways, forming as many as 15 monomer ic configurations 
(Hfing_fi ai,. (1988) J. Chrom. . ±42: 183) because cell 
5 breakage and too rapid formation of disulfide linkages 
results in random disulfide bond formation. In order to 
produce biologically active IGF-l, the resulting mixture 
of 15 different forms of IGF-l must be separated. 
Consequently, the yield of purified product is very low 
10 (Grossgian (1985) Gene , jjfc: 199). 

Furthermore, since coli is a prokaryote, in order 
to produce IGF-l molecules which contain the authentic N- 
terminal glycine, and not the initiating methionine 
present on the primary translation product, it is 
15 necessary to express IGF-l in JL. coli as a fusion 

protein. Cleavage of mature IGF-l from the initially 
produced fusion protein necessitates an additional step 
in the production process. Consequently, attempts to 
produce this peptide by recombinant means in £. coli host 
20 expression systems results in a complex mixture of 
product forms which must be separated for further use 
(see, Grossgian (1985) Gene f 1&: 199). 

Bukaryotic host cells, including yeast cells, thus, 
are the host cells of choice for the expression of many 
!5 eukaryotic proteins. Yeast host cells offer clear 
advantages over bacteria in the production of 
heterologous proteins, including their ability to 
properly process pre-pro-heterologous proteins and 
secrete heterologous proteins into the culture medium. 
0 Secretion of proteins from cells is often superior to 
production of proteins in the cytoplasm because secreted 
products are obtained in a higher degree of initial 
purity and further purification of the secreted products 
is made easier by the absence of cellular debris, in 
5 addition, the secretory pathway of the cell and the 
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extracellular medium tend to be oxidizing environments 
which support disulfide bond formation necessary for 
proper folding of many proteins (Smith, et aj^ (1985) 
Science 223.1 1219) ; whereas, the cytoplasm is a reducing 
environment in which disulfide bonds do not form. Thus, 
for production of sulfhydryl-rich proteins that rely on ' 
disulfide bonds to maintain the correct tertiary 
structures, there is a compelling need to develop 
eukaryotic hosts capable of secreting such proteins into 
the culture medium. Therefore, production of sulfhydryl- 
rich proteins, such as IGF-l, that contain appropriately 
formed disulfide bonds, can best be achieved by transit 
through the secretory pathway. 

IGF-l has been cloned into and expressed using Sj. 
gerevAs i ae host cells by introducing DMA encoding IGF-l 
on autonomously replicating extrachromosomal elements. 
Gellerfors et al. ((1989) J. Blm . , 264 . 11444 _ 

11449) describes the production of IGF-l in S. cerev1«n a ,» 
under the control of the ^ cerevisia* actin promoter. 
The IGF-l product is encoded by autonomously replicating 
plasmid-borne DNA. in a similar study, Bayne et al. 
((1988) Gene 66: 235-244) describes the production of 
IGF-l in S^. cerevisiaa under the control of the 
cerevlsjae alpha mating factor promoter. The latter, 
however, reports yields of IGF-l of only about 2 mg of 
IGF-l per liter of fermentation broth. 

In view of this low yield and the problems generally 
encountered with up-scaling the production of 
heterologous proteins in autonomous plasmid-based yeast 
systems, such as loss of selection for plasmid 
maintenance and problems concerning plasmid distribution, 
copy number and stability in fermentors operated at high' 
cell density, there is a need to develop more efficient 
means for producing large quantities of biologically 
35 active IGF-l. 
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Therefore, it is an object of this invention to 
provide host cells and expression vectors that stably 
express IGF-1 and that secrete high concentrations of 
biologically active IGF-1. 
5 It is another object of this invention to provide an 

expression system for the production of biologically 
active IGF-1 that, not only secretes high concentrations 
of biologically active IGF-1, but that can be readily 
scaled up to produce large quantities of such IGF-1. 
10 SUMMARY OF THE IKVEKTION 

Expression systems and method using the expression 
systems for the production of biologically active 
insulin-like growth factor-1 (IGF-1) using methylotrophic 
yeast host cells are provided. The methods of production 
15 are readily scaled up from shake-f lask cultures to large 
scale fermentors with no loss in IGF-1 productivity and 
without the need for making major changes in the 
fermentation conditions used for the growth of the 
transformed strains Methods for isolation and 
20 purification of the IGF-1 product are also provided. 

The expression systems and methods provided herein 
avoid the problems encountered with heterologous protein 
expression in cerevisiae in which high level 
expression can only be achieved by the introduction of 
25 multicopy plasmids into the host cells. 

The expression system described herein uses 
methylotrophic yeast host cells, such as for example, £^ 
PfrstorlP f for the expression of IGF-1. Key features of 
the system include the ability to stably integrate and 
30 express multiple copies of the DNA encoding IGF-1 and the 
DNA encoding the signals that direct secretion and 
processing of the IGF-1 and the ability to properly 
process mature IGF-1 from the expressed precursor form 
IGF-1 and to secrete the resulting mature IGF-1 product. 
35 Another feature of the system resides in selection 
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of the promoter that has been used to control expression 
of the DNA encoding ICP-I. The promoter, which is derived 
from a methanol-responsive gene, such as A0X1, of a 
methylotrophic yeast, is tightly regulated and provides 
for high-level regulated expression of genes placed under 
its control (see, e^, European Patent Application No. 
85113737.2, published June 4, 1986, under No. 0 183 071, 
now issued in the United States as U.S. Patent No. 
4,855,231). 

Expression and secretion of high levels of IGF-l 
peptide has been accomplished by transforming a 
methylotrophic yeast host with a DNA construct that 
contains at least one copy, but may contain as many as 
six or more copies, of DNA encoding an IGF-l peptide in 
which the DNA is operably linked with DNA encoding a 
signal sequence that is effective for directing the 
processing and secretion of the IGF— l peptide product. 
The DNA construct also includes a promoter region, which 
directs expression of the DNA encoding the signal 
sequence and IGF-l peptide, and a transcription 
terminator functional in a methylotrophic yeast. 

The DNA construct provided here, also includes 
sequences of nucleotides that have sufficient homology 
with a target gene in the methylotrophic yeast host cell 
25 genome, to effect stable integration. Integration takes 
place by addition or replacement at the site of the 
target gene. Alternatively, the DNA construct may be 
part of a circular plasmid, which will integrate by 
addition at a site of homology between the host and the 
30 plasmid sequence. 

In accordance with other embodiments, expression 
vector containing the DNA construct which includes at 
least one copy of an expression cassette are provided, 
hereinabove. 

According to another aspect, there are provided 
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novel methylotrophic yeast cells containing in their 
genome at least one copy of the above-described DNA 
fragment. In preferred embodiments the cells are 
deficient in one or more proteases that are not essential 
5 for proper cleavage of the precursor form of IGF-1. 
In preferred embodiments, the host cell is 
pastor is . the promoter is the AOX1 promoter, and the 
signal sequence is the £«_ cerevisiae alpha-mating factor 
(ctMF) pre-pro sequence, which includes a processing 

10 sequence of the formula lys-arg-(glu-ala) x in which x is 
preferably between 0 and 3. Methylotrophic yeast cells 
in which at least one copy of this DNA construct has been 
introduced efficiently produce and secrete biologically 
active IGF-1 peptides into the medium. In preferred 

15 embodiments, IGF-1 has been very efficiently produced in, 
and secreted from the methylotrophic yeast £j_ pastor is . 

The polypeptide product produced by the 
methylotrophic host cells, is secreted into the culture 
medium at high concentrations; the level of IGF-1 

20 peptides secretion is many times the concentrations that 
are reported for the secretion of IGF-1 from Qj_ 
cerevisiae (see, e.g. , Bayne' et al. (1988) Gene . 66: 235- 
244) « IGF-1 peptides are produced by methylotrophic 
yeast cells that contain in their genome one or more 

25 copies of a DNA sequence operably encoding IGF-1 peptides 
operably linked with DNA encoding a signal sequence, such 
as the 8^. cerevisiae a-mating factor (aMF) pre-pro 
sequence, which includes at least one site, such as lys- 
arg, that is sufficient for proper proteolytic cleavage 

30 of mature IGF-1 from the pre-pro-protein, under the 

regulation of a promoter region of a methanol-responsive 
gene of a methylotrophic yeast. In addition, the DNA 
encoding the signal sequence may encode one or more glu- 
ala sequences, which also serve to direct pr per 

35 processing. The number of glu-ala sequences is 
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preferably from 0 to 3, but may include mor c pies. The 
number of copies is limited by the ability of the 
selected host to properly process the pre-pro-protein. 
According to a still further embodiment, there is 
5 provided a process for producing IGF-1 peptides by 

growing methylotrophic yeast transf ormants containing in 
their genome at least one copy of a DNA sequence operably 
encoding an IGF-1 peptide, operably associated with DNA 
encoding the jL. cerevisiae otMF pre-pro sequence. This 
10 pre-pro sequence includes a processing sequence lys-arg 
and may also include from 0 to 3 glu-ala spacer segments. 
Transcription of both the DNA encoding the signal and the 
DNA encoding the IGF-l peptide is under the regulation of 
a promoter region of a methanol-responsive gene of a 
15 methylotrophic yeast. Under conditions that induce the 
promoter transcription of the DNA is induced resulting in « 
expression and processing of the precursor IGF-l and 
secretion of IGF-1 peptides into the culture medium • 

Cultures of viable methylotrophic yeast cells 
capable of producing IGF-1 peptides are also within the 
scope of the present invention. In preferred 
embodiments, the yeast host cells that express and 
secrete IGF-1 have been modified by disruption of one or 
more host cell genes that directly or indirectly 
25 influence host cell protease expression. This disruption 
results in a reduction of some protease activities of the 
host cell, for example, proteinase A and carboxypeptidase 
Y activities. 

A preferred process for recovering and purifying 
30 IGF-1 peptides from £^ pastoris fermentation broth is 
described in U.S. Patent Application Serial No. 
07/641,430. The preferred form of IGF-1 is a 
polypeptide product which exhibits substantially the same 
biological activities as natural insulin-like growth 
factor-l, as measured in recognized bioassays (see, e.g. . 
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Takano et al. (1976) Acta Endocr. 82 : 449-459; Schoenie 
££ &Ll (1982) Mature 296: 252-253; Copeland et al^ (1983) 
Am. J. Primatology 5: 161-169 and Buul-Offers £t al« 
(1986) Ped.Res. gOz 825-827) having substantially the 
5 same amino acid sequence as native IGF-1, Sequence ID No. 
1, in the sequence listings. It is understood that 
polypeptides having some variation in amino acid 
sequences such as lacking one or more amino acids, or 
containing additional amino acids, or having some 
10 substituted amino acids are within scope of the 

invention, provided that these peptides exhibit the 
functional activity of IGF-1 according to known 
bioassays. Such IGF-1 may include, not only 
authentically folded IGF-1, but also the misfolded and 
15 multimeric forms of IGF-1 that exhibit the ability to 
detectably bind to IGF-1 receptors and that demonstrate 
at least one bioactivity associated with IGF-1 . In 
addition, biologically inactive forms may be converted to 
active forms by reduction and oxidation* 
20 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 provides the nucleotide sequence of a 
synthetic insulin-like growth factor gene. 

Figure 2 is a restriction map of plasmid pIGF20l. 
Figure 3 is a restriction map of plasmid pIGF202. 
25 Figure 4 is a restriction map of plasmid pIGF204. 

Figure 5 is a restriction map of plasmid pIGF206. 
Figure 6 is a restriction map of plasmid pA0815. 
DETAILED DESCRIPTION OF THE INVENTION 
Definitions: 

30 Unless defined otherwise, all technical and 

scientific terms used herein have the same meaning as is 
commonly understood by one of skill in the art to which 
this invention belongs. All publications mentioned 
herein are incorporated by reference theret ♦ All U.S. 

35 patents mentioned herein are incorporated in their 
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entirety by reference thereto. 

As used herein, IGF-l-or an IGF-l peptide is 
intended to include all the allelic variations of IGF-l. 
Moreover derivatives obtained by simple modification of 
the amino acid sequence of the naturally occurring 
product, such as by way of site-directed mutagenesis or 
other standard procedures, are included within the scope 
of the present invention. Forms of IGF-l that exhibit 
similar biological activities to naturally occurring IGF- 
1 are also encompassed by the present invention. IGF-l 
is intended to encompass all forms of IGF-l present in 
the culture medium. Such IGF-l may be mixture of several 
different three-dimensional forms. In addition to 
authentic IGF-l, which is the intact, the monomeric, 
15 correctly folded material, other forms may be present in 
the extracellular medium of the host cells that express 
IGF-l, such as misfolded proteins, multimeric forms such 
as diners, and trimers. It is intended that the IGF-l 
peptide, as used herein, includes any peptide that has 
20 the ability to bind to IGF-l receptors and that exhibits 
at least one activity, such as the ability to promote 
cell proliferation or cell growth in a standard activity 
assay, that is associated with the biological activity of 
IGF-l (see, e^gj., Takano et al^ (1976) Acta Endoer. £2. : 
25 449-459; Schoenie et ai. (1982) Nature 296 ; 252-253; 

copeland £fc ^ (1983) Am. J. v^i-m^r^y s . i 6 i-i 6 9 and 
Buul-Offers fife al,. (1986) Ped.Res. 2J£: 825-827) . 

As used herein, an IGF-l peptide that has biological 
activity is a peptide that specifically binds to IGF-l 
receptors, as detected by a standard assay, and that 
exhibits a biological activity associated with IGF-l as 
measured in a recognized assay. Such assays are known to 
those of skill in this art. 

As used herein, authentic IGF-l refers to IGF-l that 
includes disulfide bonds as they exist in naturally- 
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occurring IGF-l. Authentic IGF-l may be efficiently 
recovered and purified from pastoris fermentation 
broth. 

As used herein, mature IGF-l refers to processed 
5 IGF-l from which the signal and processing sequences have 
been cleaved. Mature IGF-l includes authentic IGF-l and 
any other forms of processed IGF-l that are secreted into 
the medium . IGF-l is intended to encompass peptides that 
have the ability to bind to IGF-l receptors and to 
10 promote cell growth or proliferation as measured by any 
means known to those of skill in the art. 

As used herein, pre-pro-IGF-1 refers to a 
polypeptide that includes a leader or signal sequence 
that effects secretion of mature IGF-l into the 
15 extracellular space of said host and one or more 

processing signals that direct processing of the pre-pro- 
IGF-l to produce the mature IGF-l. As used herein, 
secreted IGF-l refers to processed IGF-l that does not 
include the signal or leader sequence. Processed IGF-l 
20 or processed protein refer to IGF-l or protein from which 
the leader signal has been cleaved. 

As used herein, a signal or leader sequence, which 
expressions are used interchangeably, refers to a 
sequence of amino acids that effects transport of a 
25 linked polypeptide through the cell membrane. A signal 
sequence refers to a sequence of hydrophobic amino acids 
at the amino terminus of the protein to which it is 
linked. DNA encoding a signal sequence is located 
downstream (3' in the direction of transcription) from 
30 the ATG start codon and upstream (5 1 ) from the DNA that 
encodes the structural gene. In addition, the signal 
sequence includes one or mores sequence of amino acids, 
that are recognized by one or more host cell proteases, 
processing sites, int rpos d between the signal sequence 
35 and the protein, whereby removal f the signal may be 
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effected. The signal sequence, processing sites and 
protein are referred to as a pre-pro-protein. 

The signal sequences and processing sites 
contemplated for use herein are those that effect 
transport of IGF-l through the cellular membrane of a 
methylotrophic yeast host cell, such as ^ ESstorjLs. In 
preferred embodiments the signal sequence and processing 
sites, lys-arg and (glu-ala) x , in which x is an integer, 
preferably between 0 and 3, are derived from the ^ 
cereyisjae «MF gene. Any signal sequence and processing 
sites that are effective for secreting mature IGF-1 into 
the extracellular space of a methylotrophic host known to 
those of skill in the art may be used, in preferred 
embodiments, the signal sequences are selected so that 
fermentation cultures of methylotrophic host cells that 
have a cell density of about 300 g/L HeOH and that 
secrete at least about 10 mg/ml of authentic IGF-1 into 
the broth. DNA constructs encoding suitable signal 
sequences and processing sites linked to DNA encoding 
IGF-l may be prepared and introduced into the genome of a 
methylotrophic host by methods known to those of skill in 
the art or described herein and tested for their ability 
to direct secretion of mature IGF-l into the medium. Any 
signal sequence that leads to secretion relatively high 
concentrations, preferably more than lOmg/ml of mature 
IGF-l, is contemplated for use herein. 

As used herein, heterologous DNA includes DNA that 
does not occur naturally as part of the genome in which 
it is present or which is found in a location or 
locations in the genome that differs from that in which 
it occurs in nature. Heterologous DNA is generally not 
endogenous to the cell into which it is introduced, but 
has been obtained from another cell. Generally, although 
not necessarily, such DNA encodes RNA and proteins that 
are not normally produced by the cell in which it is 
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expressed. Heterologous DNA may also be referred to as 
foreign DNA. Any DNA that one of skill in the art would 
recognize or consider as heterologous or foreign to the 
cell in which is expressed is herein encompassed by 
5 heterologous DNA. Examples of heterologous DNA include, 
but are not limited to, DNA that encodes IGF-1, 
transcriptional and translational regulatory sequences, 
selectable or traceable marker proteins, such as a 
protein that confers drug resistance. 

10 As used herein expression cassette refers to a DNA 

construct that includes sequences functional for both the 
expression and the secretion of IGF-1. Accordingly an 
expression cassette includes DNA encoding a promoter 
region, DNA encoding a transcription terminator region, 

15 sequences sufficient for translation, secretion and 

proper processing of the expressed peptide. In addition, 
in preferred embodiments the expression cassette includes 
sequences at 5' and 3 1 ends that are derived from target 
locus in the host cell genome, whereby, upon introduction 

20 into the host cell, the expression cassette is stably 
integrated into the host cell genome. 

As used herein, the term DNA construct embraces 
expression cassettes and also includes DNA fragments that 
include more than one expression cassette. 

25 As used herein, the term operative linkage or 

operably associated refers to relationship among elements 
of a DNA construct in which elements are arranged such 
regulatory sequences of nucleotides that are part of the 
construct directly or indirectly control expression of 

30 DNA in the construct that includes protein- or peptide- 
encoding sequences of nucleotides. 

As used herein, the term "a DNA fragment operably 
encoding IGF-1 peptides 11 includes DNA fragments encoding 
IGF-1 or any other " IGF-1 peptide" as defined 

35 h reinab ve. DNA encoding IGF-1 is known in the art and 
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may be obtain d by chemical synthesis or by transcription 
of messenger RNA (mRNA) corresponding to IGF-l into 
complementary DNA (cDNA) and converting the latter into a 
double stranded cDNA. Chemical synthesis of a gene for 
5 human IGF-l is, for example, disclosed by Niwa et al. 
(1986) Annals of the NY Academy of Rnimnm r 469 : 31-52, 
and Buell et 'al, (1985) Nucleic A cids Research , j^: i 92 3- 
1938, The requisite DNA sequence can also be removed, 
for example, by restriction enzyme digest of known 

10 vectors harboring the IGF-l gene. Examples of such 
vectors and the means for their preparation are well 
known to those of skill in the art. See, e.g. . Niwa 
£1*. (1986) Annals of the NY A cademy of Science . 469 ; 31- 
52, and Buell gfr frj, (1985) Nucleic Acids Research , 13 : 

15 1923-1938. The nucleotide sequence of a presently 

preferred IGF-l gene used in accordance with the present 
invention is illustrated in FIG. 1 and is further 
elucidated in the examples. 

As used herein, the term expression vector is 

20 intended to include vectors capable of expressing DNA 
that are in operational association with other sequences 
capable of effecting their expression such as promoter 
sequences in a selected host cell. In general, 
expression vectors usually used in recombinant DNA 

25 technology are often in the form of "plasmids", which are 
circular, double-stranded DNA loops, extrachromosbmal 
elements. As used herein, the terms "vector" and 
"plasmid" are used interchangeably and are not intended 
to be limited, but to include any expression vectors or 

30 means that permits heterologous DNA to be expressed in a 
particular host cell. 

As used herein, the term "culture" means a 
propagation of cells in a medium conducive to their 
growth, and all sub-cultures thereof. The term 

35 "subculture" refers to a culture of cells grown from 
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cells of another culture (source culture), or any 
subculture of the source culture, regardless of the 
number of times subculturing has been performed between 
the subculture of interest and the source culture. 
5 The amino acids which occur in the various sequences 

of amino acid set forth in the specification have their 
usual, three- and one-letter abbreviations, routinely 
used in the art: 





Amino Acid 


Abbreviation 


10 


L-Alanine 


Ala 


A 




L-Arginine 


Arg 


R 




L-Asparagine 


Asn 


N 




L-Aspartic acid 


Asp 


D 




L-Cysteine 


Cys 


C 


15 


L-Glutamine 


Gin 


Q 




L-Glutamic Acid 


Glu 


E 




L-Glycine 


Gly 


G 




L-Histidine 


His 


H 




L-Isoleucine 


He 


I 


20 


L- Leucine 


Leu 


L 




L-Lysine 


Lys 


K 




L-Methionine 


Met 


M 




L-Phenylalanine 


Phe 


F 




L-Proline 


Pro 


P 


25 


L-Serine 


Ser 


S 




L-Threonine 


Thr 


T 




L-Tryptophan 


Trp 


W 




L-Tyrosine 


Tyr 


Y 




L-Valine 


Val 


V 



30 

Host cells. 



Yeast species contemplated for use in the practice 
of the present invention are methylotrophs, that is 
species which are able to grow on methanol as carbon 

35 sources. Species which have the biochemical pathways 

necessary for methanol utilization fall into four genera, 
Candida, Hansenula, pjlchia, and Tffr\flPP?*s. Of these, a 
substantial amount is known about the molecular biology 
of members of the species flanseymla polvmorpha and Pichia 

40 pastoris . 

P. pastoris is the presently preferred yeast 
species. pastoris is a known industrial yeast strain 
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that is capable of fficiently utilizing methanol as the 
sole carbon and energy source. 
DMA constructs. 

The DNA constructs or expression cassette used for 
5 transforming methylotrophic yeast cells contain a 

methanol-responsive promoter from a methylotrophic yeast 
gene, DNA encoding IGP-l- (IGP-1 gene), a encoding a 
signal sequence in-reading frame with the gene, and 
and a transcription terminator functional in a 
10 methylotrophic yeast. The signal sequence may be any 

such sequence that functions to direct the secretion from 
and proper processing of IGF-l in the yeast host cell. 
The Sj. cerevisjae «MF pre-pro sequence is the preferred 
signal sequence. The aMF pre-pro sequence includes a 
15 DNA sequence encoding a processing sequence lys-arg, and 
(glu-ala) x spacer sequences wherein X is o, l, 2 or 3. 

The cerevisiae alpha-mating factor is a 13- 
residue peptide, secreted by cells of the "alpha" mating 
type, that acts on cells of the opposite "a" mating type 
20 to promote efficient conjugation between the two ceil 
types and thereby formation of "a-alpha" diploid cells 
(Thorner et al. (1981) The Molecul ar Biology the Yeast 
SaccharpmYcefi , cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 143) . The a-factor is synthesized as a 165 
25 amino acid long precursor that contains an 83 residue 

leader and four a-factor peptide regions. Each region is 
preceded by a short spacer peptides of sequences: lys-arg- 
glu-ala-asp-ala-glu-ala; lys-arg- (glu-ala) s or lys-arg- 
(glu-ala) 2 . The leader and spacer includes the amino acid 
sequences for proteolytic processing and secretion (see, 
&±Sj_, Brake et aj. t , Proc. Nat l. Acari. s c i. PSA . 81: 4642 
(1984)). 

There are a number of methanol-responsive genes in 
methylotrophic yeast. The expression of each is 
controlled by methanol-responsive regulatory regions, 



30 



35 



also referred to as promoters. Any of such methanol- 
responsive promoters are suitable for use in the DNA 
constructs. Examples of specific regulatory regions 
include, but are not limited to, the promoter for the 
primary alcohol oxidase gene from Pichia pastoris A0X1, 
the promoter for the secondary alcohol oxidase gene from 
El. pastoris A0X2 , the promoter for the dihydroxyacetone 
synthase gene (DAS) from Ex pastoris . the promoter for 
the P40 gene from Es. pastoris the promoter for the 
catalase gene from Ei. pastoris . Selection of suitable 
promoter regions and other regulatory regions is, in 
light of this disclosure, within the level of skill in 
the art. 

The presently preferred promoter region employed to 
drive IGF-1 gene expression is derived from a methanol- 
regulated alcohol oxidase gene of E*. pastoris . P. 
pastoris is known to contain two functional alcohol 
oxidase genes: alcohol oxidase I ( AOXl l and alcohol 
oxidase II f A0X2 ) genes. The coding portions of the two 
AOX genes are closely homologous at both the DNA and the 
predicted amino acid sequence levels and share common 
restriction sites. The proteins expressed from the two 
genes have similar enzymatic properties but the promoter 
of the AQXl gene is more efficient and provides for 
higher levels of gene expression; therefore, its use is 
preferred for IGF-1 expression* The AQXl gene, including 
its promoter, has been isolated and thoroughly 
characterized; see Ellis fit aj y (1985) Mol. Cell. Biol. 
5: llll and U.S. Patent No. 4,855,231. 

The DNA construct that is introduced into 
methylotrophic yeast cells contains, in addition to a 
methanol-responsive promoter of a methylotrophic yeast 
gene and DNA encoding IGF-1-, and DNA encoding an in- 
reading frame signal and processing sequence, c ntains a 
transcription terminat r functi nal in a methyl tr phic 
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yeast. The transcription terminator functional in a 
methylotrophic yeast used herein has either (a) a 
subsegment which encodes a polyadenylation signal and 
polyadenylation site in the transcript, and/or (b) a 
5 subsegment which provides a transcription termination 
signal for . transcription from the promoter used in the 
expression cassette. The entire transcription terminator 
is taken from a protein-encoding gene, which may be the 
same or different from the gene which is the source of 

10 the promoter. 

The DNA construct may also contain a selectable 
marker gene. For this purpose, any selectable marker 
gene functional in methylotrophic yeast may be employed, 
including, but not limited to, any gene which confers a 

15 selectable phenotype upon methylotrophic yeast permitting 
such yeast cells to be identified and selectively grown 
from among a vast majority of untransformed cells. 
Appropriate selectable marker systems, for example, would 
be an auxotrophic mutant pastoris host strain and a 

20 wild type biosynthetic gene which complements the host's 
defect. For example, the 2*. cerevisiae or ^ Pastoris 
BIS4 gene may be used to complement His4~ pastoris 
strains, or the S-_ cerevisiae ARG4 gene or the P, 
pastoris ARG4 gene may be employed to complement Arg4~ 

25 mutants. 

In addition, the DNA constructs may additionally 
include selectable marker genes which are functional in 
bacteria. Thus, any gene can be used which confers a 
phenotype on bacteria that allows transformed bacterial 

30 cells to be identified and selectively grown from among a 
vast majority of untransformed cells. This additional 
selectable marker permits such DNA to be introduced into 
and amplified in bacterial" host cells, such as £^ coli . 
Suitable selectable marker genes include the ampicillin 

35 resistance gene (Amp r ) , tetracycline resistance gene 
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(Tc r ), and the like. 

The DNA construct may Include sequences allowing for 
replication and selection in bacteria, especially E. 
coli . In particular, the DNA construct may be inserted 
5 into a plasmid or it may be circularized to form a 
plasmid that includes sequenes for replication and 
extrachromal maintenance in bacteria. In this way, large 
quantities of the DNA fragment can be produced by 
replication in bacteria. When such amplification is 

10 desired, the DNA construct should include bacterial 

origin of replication, to ensure the maintenance of the 
DNA construct from generation to generation of the 
bacteria. Exemplary bacterial origins of replication 
include the f 1-ori, colisin, col El, and others that are 

15 known to or that can be identified by those of skill in 
the art. 

The DNA construct may include multiple copies of the 
DNA encoding IGF-1 operatively linked to the signal and 
processing signals, and other transcriptional regulatory 
20 signals, i.e. . the DNA construct may include multiple 
copies of the expression cassette. 

Transformation of methylotrophic yeast host cells with 
the DNA constructs. 

Methods for transforming methylotrophic yeast, such 

25 as, for example, £*. nastoris . as well as methods for 

culturing methylotrophic yeast cells containing in their 

genome a gene encoding a heterologous protein, are 

generally known in the art. The expression cassettes may 

e introduced into the methylotrophic yeast cells by any 

30 method known to those of skill in the art. Preferred 

methods include the spheroplast technique, described by 

Cregg et ai^ (1985) Mol. Cell. Biol. £: 3376 and U.S. 

Patent No. 4,879,231, and the whole-cell lithium chloride 

yeast transformation system (Ito et al. (1984) Acrrlc. 

35 Biol. Chem. 48 ; 341) , with modification necessary for 
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adaptation t methyl trophic yeast such as pastoris 
(See European Patent Application No. 312,934; also 
available as U.S. Patent No, 4,929,535). 

If the yeast host is transformed with a linear DNA 
5 fragment containing the IGF-1 gene and DNA encoding aMF 
pre-pro sequences necessary for processing and secretion 
under the regulation of a promoter region of a P. 
pastoris gene, the expression cassette is integrated into 
the host genome by any of the gene replacement techniques 

10 known in the art, such as by one-step gene replacement 
(see e.g., Rothstein (1983) Methods En z vino 1, lQi ; 202; 
Cregg fil^ (1987) Bio /Technology Si 479; and U.S. 
Patent No. 4,882,279) or by two-step gene replacement 
methods (see e.g., Scherer and Davis (1979) Proc. Natl. 

15 Acad. Set. USA, 2£: 4951) . .The linear DNA fragment is 
directed to the target gene by flanking DNA sequences 
having sufficient homology with the target gene to effect 
integration of the DNA fragment therein. One-step gene 
disruptions are usually successful if the DNA to be 

20 introduced has as little as 0.2 kb homology with the 
fragment locus of the target gene; it is preferable 
however to maximize the degree of homology for 
efficiency. 

In preferred embodiments, multiple copies of these 
25 expression cassettes are included on one DNA fragment, 
preferably in a head-to-tail orientation. 

If the DNA fragment is included in a circular 
plasmid, the plasmid may be integrated by addition rather 
than gene disruption into the genome. One or more copies 
30 of the plasmid can be integrated at the same or different 
loci of the genome. Integration into the genome is 
facilitated by linearization of the plasmid by means of a 
suitable restriction endonuclease. 

The DNA fragments of the expression cassette (s) are 
35 said to be "operably associated" with one another. The 
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DNA sequence encoding IGF-1 peptides is positioned and 
oriented functionally with respect to the promoter, the 
DNA sequence encoding the £2. cerevisiae aMF pre-pro 
sequence and the transcription terminator. Thus, the 
5 polypeptide-encoding segment is transcribed under 
regulation of the promoter region, into a transcript 
capable of providing, upon translation, the desired 
polypeptide. Because of the presence of the aMF pre-pro 
sequence, the expressed IGF-1 product is secreted into 
10 the culture medium* Appropriate reading frame 

positioning and orientation of the various segments of 
the expression cassette are within the knowledge of 
persons of ordinary skill in the art; further details are 
given in the Examples. 

15 Positive transformants may be isolated, identified 

and characterized by any method known to those of skill 
in the art, including Southern blot analysis (Maniatis et 
aii (1982) Molecular Cloning; A Laboratory Manual . Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New 

20 Xork, USA) to determine, for example, the site of DNA 
integration; Northern blots (Maniatis et al. (1982) 
EQlegVflar Cloning; A Laboratory Manual. Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York, 
USA ; R.S. Zitomer and B.D. Hall (1976) J. Biol. Cham . 

25 251 : 6320) , for example, to confirm methanol-responsive 
IGF-l gene expression; and product analysis to detect the 
presence of secreted IGF-1 peptides in the growth media. 

The presently preferred host cells to be transf ormed 
with multiple expression cassettes are pastor is cells 

30 having at least one mutation that can be complemented 
with a marker gene present on a transforming DNA 
fragment* Preferably His4" (GS115) or Arg4" (GS190) 
auxotrophic mutant Ee_ nastoris strains are employed. The 
most preferred strains are additionally deficient in 

35 expression of one r more pr teases that are not required 
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for proper pr cessing of the selected leader sequence. 
The def iency may be manifested as an absence of or a 
decrease in the activity of one or more proteases by 
virtue of point mutations, insertions, or deletions in 
5 genes that encode proteases or genes that directly or 
indirectly modulate expression of genes that encode 
proteases . 

In preferred embodiments, DNA fragments containing 
one or more expression cassette (s) are inserted into a 
10 vector containing a marker gene complementing the host's 
defect, and optionally containing additional sequences 
such as bacterial marker genes, yeast sequences which 
direct vector integration, and the like. Plasmids based 
on pBR322, such as pA0815, are preferred as vectors. 
15 Insertion of one or more copies of the IGF-1 

expression/secretion cassette into parent plasmid pA0815 
produces plasmids such as pIGF201, pIGF202, pIGF204, 
PIGF206, and pIGF816. 

Protease-deficient strains of ^ pastorls , known as 
pe P 4- or pep4" prb-r strains, described below and in U.S. 
Patent Application Serial No. 07/678,916, are among the 
preferred host cells. Protease-deficient P^. pastor is 
strains are generated by disruption of pastoris genes, 
such as the E£E± gene or PRB-i gen e, that encodes a 
protein that directly or indirectly affects the protease 
activity of the cell. This disruption is preferably 
accomplished by the insertion of an additional plasmid, 
unrelated to the constructs for expression of IGF-i, into 
the EEEA gene or PBR-1 gene. This disruption results in 
30 a reduction of some of the protease activities in the 
cells, such as, for example, proteinase A, proteinase B 
and carboxypeptidase Y activities. This host cell strain 
preferably also contains a mutation which can be 
complemented by a marker gene carried on the expression 
35 cassette, as discussed below in the Examples. 
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The preferred host cells may be transformed with the 
expression vectors that are introduced into targeted 
specific sites in the host cell genome in order to 
generate Mut" expression strains of £^ pastoris . "Mut" 
5 refers to the ©ethanol-utilization phenotype. In certain 
embodiments, Mut" strains are prepared by first digesting 
the selected expression vector with an appropriate enzyme 
to yield a linear DNA fragment with ends homologous to 
the 5 1 and 3' ends of the A0X1 gene* The linearized DNA 

10 construct expression cassette (s) is then integrated into 
the host genome at the AQX1 site by a one-step gene 
replacement technique. As a result of gene replacement 
of the AOX1 . Mut" strains are obtained* In Mut" strains, 
strains 1 ability to utilize methanol is decreased* A 

15 slow growth rate of the strain on methanol is maintained 
by expression of the gene product. The 

transformants in which the expression cassette has 
integrated into the A0X1 locus by site-directed 
recombination can be identified by first screening for 

20 the presence of a complementing marker gene present in 

the expression cassette. This is preferably accomplished 
by growing the cells in medium lacking the complementing 
gene product and identifying those cells which are able 
to grow by virtue of the expression of the complementing 

25 gene. Next, the selected cells are screened for their 
"Mut" phenotype by growing them in the presence of 
methanol and monitoring their growth rate. Mut" 
phenotypes have much slower growth on methanol than the 
parent strain. 

3 0 To develop Mut* IGF-l-expressing strains, the 

fragment that contains one or more expression cassette (s) 
preferably is integrated into the host genome by 
transformation of the host with a circular plasmid or a 
linearized plasmid containing the DNA c nstruct. The 

35 integration is by addition at a 1 cus or loci having 
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homology with one or more sequences present in the 
fragment. 

Positive transformants may be characterized by 

Southern analysis to identify, for example, the site of 

5 DMA integration; by Northern analysis to detect, for 

example, for methanol-responsive IGF-1 gene expression; 

and by product analysis to detect the presence of 

secreted IGF-1 peptides in the growth medium. 

Culturing transformed yeast host cells that harbor the 
10 DMA constructs. 

The transformed host cells are cultured under 
conditions in which the DKA encoding IGF-1, signal and 
processing signals, is expressed, the pre-pro-peptide is 
processed and secreted. Transformed strains, of the 

15 desired phenotype and genotype, may be grown in large 
quantities by any means known to those of skill in the 
art. In preferred embodiments, the strains are cultured 
in fermentors. For the large-scale production of 
recombinant DNA-based products in methylotrophic yeast, a 

20 three-stage, high cell-density, fed batch fermentation 
system is normally the preferred fermentation protocol 
employed. 

In the first, or growth stage, expression hosts are 
cultured in defined minimal medium with an excess of a 

25 non-inducing carbon source, such as but not limited to, 
glycerol. When grown on such carbon sources, 
heterologous gene expression is completely repressed, 
which allows the generation of cell mass in the absence 
of heterologous protein expression. It is presently 

30 preferred, during this growth stage, that the pH of the 
medium be maintained at about 5. The second stage, 
referred to as a period of growth under limiting 
conditions, refers to a short period of non-inducing 
carbon source limitation growth. In this stage, cell 

35 mass continues to increase and the methanol-responsive 
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promoter is derepressed. Although a pH of about 5 is 
preferred for optimal growth of £_£_ pastoris , undetectably 
or or low levels of IGF-1 are secreted by proteolytically 
intact IGF-l-expressing strains that are cultured 
5 entirely at pH 5. High concentrations of IGF-1 are 
produced, however, if the pH of the medium during this 
limitation growth period is adjusted to about 4 or less, 
preferably in the range of about 2 - 3.5. This pH is 
maintained during the production phase. 

10 Expression of IGF-1 from protease deficient strains 

is less sensitive to pH than expression from 
proteolytically intact strains. Protease deficient 
strains may be cultured entirely at pH 5*. 
Alternatively, and preferably, protease deficient strains 

15 are cultured at pH 5 during the first two stages, after 
which the pH is lowered to about 2.5 to about 3.0 during 
the third stage. 

In the third stage, the production stage, either 
methanol alone, referred to -herein as "methanol excess 

20 fed-batch mode", or a limiting amount of a non- inducing 
carbon source plus methanol, referred to herein as 
"mixed-feed fed-batch mode"; are added in the fermentor. 
Addition of methanol induces expression of genes, 
including IGF-1, that are in operative linkage with 

25 methanol-responsive promoters. 

According to a preferred embodiment, the 
heterologous protein expression system used for IGF-1 
production utilizes the promoter derived from the 
methanol-regulated AOX1 gene of P*. pastor is . which is 

30 tightly regulated and provides for efficient gene 
expression. This gene can be the source of the 
transcription terminator as well. The presently 
preferred expression cassette comprises, in operational 
association with one another, the Rj_ pastoris AOX1 

35 promoter, DNA encoding the S„ cereyjlsjlae otMF pre-pro 
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sequence including the DNA sequence encoding the 
processing sequence lys-arg-(glu-ala) x , wherein x may vary 
from 0 to 3, a DNA sequence encoding IGF-1 polypeptides, 
and a transcription terminator derived from the 1U. 
5 pastoris A0X1 gene. Preferably, two or more of such 
expression cassettes are contained on one DNA fragment, 
in head-to-tail orientation, to yield multiple expression 
cassettes on a single contiguous DNA fragment. 

Selected methylotrophic yeast transf ormants that 

10 have the desired genotype and phenotype are cultured in 
f ermentors under conditions in which IGF-1 is expressed 
and secreted into the culture medium. It is presently 
preferred to use the three-step production process 
described above. The level of IGF-1 secreted into the 

15 media can be determined by Western blot analysis of the 
media in parallel with an IGF-1 standard, using anti-lGF- 
1 antisera; by radioimmunoassay (RIA) ; by radio receptor 
assay; or by HPLC after suitable pretreatment of the 
medium. 

20 As noted above, IGF-1 as used herein is intended to 

embrace all the allelic variations of IGF-1. Moreover, 
as noted above, derivatives obtained by simple 
modification of the amino acid sequence of the naturally 
occurring product, e.g. . . by way of site-directed 

25 mutagenesis or other standard procedures, are included 
within the scope of the present invention. Forms of IGF- 
1 produced by proteolysis of host cells that exhibit 
similar biological activities to naturally occurring IGF- 
1 are also encompassed by the present invention. 

30 The following examples are included for illustrative 

purposes only and are not intended to limit the scope of 
the invention. 

EXAMPLES 

£j_ pastoris is described herein as a model system 
35 for the use of methylotrophic yeast hosts. Methylotrophic 
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yeast host cells from four g nera: Candida, Hansenjala, 
Pichia and y«arulopsis may also be used. Host cells from 
any species that demonstrably grows on methanol as the 
sole carbon nutrient may be used (see, e^su, Gleeson e£ 
5 sl^. (1988) yeast ±: 1) . Such species may be used as 
described herein for L. ESSifiEiE- 

EXAMPLE 1 

CONSTRUCTION OP THE PICHIA PASTORIB EXPRESSION VXCTOR8 
USED IK THE PREPARATION OF THE IOP-1 EXPRESSION VECTORS. 

10 A. construction of plasmid pAO203. 

Plasmid pAO203 contains a 5* £coRI-3 •Hindlll DNA 
fragment encoding the ctMF pre-pro region, the amino acids 
of protease processing sites, lys-arg and (glu-ala) 2 and 
the AOX1 transcription terminator. 

15 The AOX1 transcription terminator was isolated from 

pPG2.0. Plasmid pPG2.0 had been prepared by inserting 
the BamHI-Hindlll fragment of pG4.0 (NRRL 15868), which 
contains the AOX1 terminator, into pBR322. 20 /xg of 
pPG2.0 was digested with stul, followed by the addition 

20 of 0.2 Mg Sai l linkers (GGTCGACC) . The plasmid was 
subsequently digested with Hindlll and the 350 bp 
Hind lll-sall fragment was isolated from a 10% acrylamide 
gel and subcloned into pUCl8 (Boehringer Mannheim) that 
had been digested with Hindlll and Sail. The ligation 

25 mix was transformed into JM103 cells and Amp" colonies 
were selected. The correct construction was verified by 
Hindlll and sail digestion, which yielded a 350 bp 
fragment, and was called pA0201. 

5 fig of pA0201 was digested with Hind.HI, filled in 

30 using £j- coli DNA Polymerase I Klenow fragment, and 0.1 
itg of Iglll linkers (GAGATCTC) were added. After 
digestion of the excess figlll linkers, the plasmid was 
reclosed and transformed into MC1061 cells. Amp* ceils 
were selected, DNA was prepared, and the correct plasmid 

35 was verified by Balll. gall double digests, which yielded 
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the expected 350 bp fragment, and by a Hlnd lll digest, 
which shoved loss of a Hindlll site. This plasmid was 
designated pAO202. 

An alpha factor-GRF fusion was isolated as a 360 bp 
5 BamH I- Pst I partial digest from pYSV201. Plasmid pYSV201 
had been prepared by inserting the EcoR I -BamH I fragment 
of GEF-E-3 , which is described in European Patent 
Application No, 206,783, into M13rapl8 (New England 
Biolabs) . 20 y,q of pYSV201 plasmid was digested with 

10 fiamHI and partially digested with Pst l. The following 
oligonucleotides were added to this partial digest: 
5« AATTCGATGAGATTTCCTTCAATTTTTACTGCA 3 1 (Seq. ID No. 8) 
3' GCTACTCTAAAGGAAGTTAAAAATG 5« (Seq. ID No. 9) . 

Only the antisense strand of the oligonucleotide was 

15 kinase labelled so that the oligonucleotides did not 
polymerize at the 5 1 - end. A 350 bp fragment was 
isolated by electroelution following acrylamide gel 
electrophoresis (10%) . This BcoRI- BamH l fragment of 385 
bp was cloned into pA0202 which had been cut with EcoR I 

20 and SafiHI. MC1061 cells were transformed and Amp r cells 
were selected. The resulting plasmid, pA0203, was 
identified by cutting with JScoRl and Eglll to yield a 
fragment of greater than 700 bp. 

B. Construction of plasmid pA0807. 

25 l. Preparation of fl-ori DNA. 

Bacteriophage f 1 DNA (50 pq) was digested with 50 
units of Rsa l and Dra l to release the «458 bp DNA 
fragment containing the f 1 origin of replication (ori) . 
The digestion mixture was extracted with an equal volume 

30 of phenol: chloroform (V/V) followed by extracting the 
aqueous layer with an equal volume of chloroform and 
finally the DNA in the aqueous phase was precipitated by 
adjusting the NaCl concentration to 0.2M and adding 2,5 
volumes of absolute ethanol. The mixture was allowed to 

35 stand on ice (4°C) for 10 minutes and the DNA precipitate 
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was collected by centrifugation for 30 minutes at 10,000 
x g in a microfuge at 4°C. 

The DNA pellet was washed 2 times with 70% aqueous 
ethanol. The washed pellet was vacuum dried and 
5 dissolved in 25 fil of TE buffer (1.0 mM EDTA in 0.01 M 
(pH 7.4) Tris buffer). This DNA was electrophoresed on 
1.5% agarose gel and the gel portion containing the »458 
bp fl-ori fragment was excised out and the DNA in the gel 
was electroeluted onto DE81 (Whatman) paper and eluted 
10 from the paper in 1H NaCl. The DNA solution was 

precipitated as detailed above and the DNA precipitate 
was dissolved in 25 Ml of TE buffer (f 1-ori fragment) . 

2. Cloning of f 1-ori into Dra l sites of 

PBR322 . 

15 Plasmid pBR322 (2 y.q) was partially digested with 2 

units of ficai „ The reaction was terminated by 
phenol : chloroform extraction followed by precipitation of 
DNA as detailed in step 1 above. The DNA pellet was 
dissolved in 20 /xl of TE buffer. About 100 ng of this 

20 DNA was ligated with 100 ng of fl-ori fragment (step l) 
in 20 pi of ligation buffer by incubating at 14 °C for 
overnight with 1 unit of T4.DNA ligase. The ligation was 
terminated by heating to 70°C for 10 minutes and then 
used to transform 2L. coli strain JM103 (Janisch-Perron et 

25 (1983) Gene 221 103). Amp* transformants were pooled 

and super infected with helper phage R408. Single 
stranded phage were isolated from the media and used to 
reinfect JM103. Amp R transformants contained pBRf 1-ori 
which contains fl-ori cloned into the Dra I sites 

30 (nucleotide positions 3232 and 3251) of pBR322. 

3. construction of the plasmid pA0807. 
Plasmid pBRf 1-ori (10 f*g) was digested for 4 hours 

at 37 °C with 10 units each of Pst I and Nde I. The 
digested DNA was phenol: chloroform extracted, 
35 precipitat d and dissolved in 25 /il of TE buffer, as 
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detailed in step 1 above. This material was 
electrophoreses on a 1.2% agarose gel and the Nde I - Pst 
I fragment (approximately 0.8 kb) containing the fl-ori 
was isolated and dissolved in 20 /il of TE buffer as 
5 detailed in step 1 above. About 100 ng of this DNA was 
mixed with 100 ng of pA0804, which had been prepared as 
described in published International Patent Application 
No. WO 89/04320, May 18, 1989 and below and which had 
been digested with £stl and Nde l and phosphatase-treated. 

10 This mixture was ligated in 20 pi of ligation buff er by 
incubating overnight at 14 Q C with l unit of T4 DNA 
ligase. The ligation reaction was terminated by heating 
at 70*C for 10 minutes. This DNA was used to transform 
JSi coli strain JM103 to obtain pA0807. 

15 C Construction of plasmid pA0804: 

Plasmid pA0804 has been described in published 
International Patent Application No. WO 89/04320, May 18, 
1989. Construction of this plasmid involved the 
following steps: 

20 Plasmid pBR322 was modified as follows to eliminate 

the EcoRI site and insert a BolI I site into the Pvu II 
site. Plasmid pBR322 was digested with IcqRI, the 
protruding ends were filled in with Klenow Fragment of 
DNA polymerase I, and the resulting DNA was 

25 recircularized using T4 ligase. The recircularized DNA 
was used to transform JL. coll MC1061 to ampicillin- 
resi stance and transformants were screened for having a 
plasmid of about 4.37 Jcbp in size without an EcoR I site. 
One such transformant was selected and cultured to yield 

30 a plasmid, designated pBR322ARl, which is pBR322 with the 
IcoRl site replaced with the sequence: 
5 1 -GAATTAATTC-3 • 
3 ■ -CTTAATTAAG-5 f . 

Plasmid pBR322ARI was digested with Pvull r and the 
35 linker having the sequence: 
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5 1 -CAGATCTG-3 1 
3 • -GTCTAGAC-5 1 
was ligated to the resulting blunt ends employing T4 
ligase. The resulting DMAs were recircularized r also 
5 with T4 ligase, and then digested with Bql ll and again 
recircularized .using T4 ligase to eliminate multiple 
Bql ll sites due to ligation of more than one linker to 
the Eyul I -cleaved pBR322ARI. The ONAs, treated to 
eliminate multiple Bql ll sites, were used to transform E*. 

10 coli HC1061 to ampicillin-resistance. Transf ormants were 
screened for a plasmid of about 4.38 kbp with a Bql ll 
site. One such transformant was selected and cultured to 
yield a plasmid, designated pBR3 2 2 aRIBGL , for further 
work. Plasmid pBR322 aRIBGL is the same as pBR322ARI 

15 except that pBR3 2 2 aRIBGL has the sequence 

5' -CAGCAGATCTGCTG-3 1 (Sequence ID No. 11) 
3 ■ -GTCGTCTAGACGAC-5 ■ (Sequence ID No . 12 ) 
in place of the Pvu II site in pBR322ARI. 

Plasmid pBR3 2 2 ARIBGL was digested with a Sai l and 

20 Salll and the large fragment (approximately 2.97 kbp) was 
isolated. Plasmid pBSAGISI, which is described, for 
example, in European Patent Application Publication No. 
0 226 752, was digested completely with Bql ll and Xho l 
and an approximately 850 bp fragment from a region of the 

25 pastoris AOX1 locus downstream from the AOX1 gene 
transcription terminator (relative to the direction of 
transcription from the A0X1 promoter) was isolated. The 
Bol II -Xho I fragment from pBSAGISI and the approximately 
2.97 kbp, Sal l -BqI II fragment from pBR3 2 2 aRIBGL were 

30 combined and subjected to ligation with T4 ligase. The 
ligation mixture was used to transform coll HC1061 to 
ampicillin-resistance and transformants were screened for 
a plasmid of the expected size (approximately 3.8 kbp) 
with a Bal ll site. This plasmid was designated pA0801. 

35 The overhanging end of the .Sail site from the 



WO 92/04363 PCT/US9 1/06452 



pBR3 2 2 aRIBGL fragment was ligated to the overhanging end 
of the Xho l site on the 850 bp pBSAGI5I fragment and, in 
the process, both the Sai l site and the Xhol site in 
pAOSOl were eliminated. 
5 Plasmid pBSAGISI was then digested with Cla l and the 

approximately 2.0 kbp fragment was isolated. The* 2.0 kbp 
fragment has an approximately 1.0-kbp segment which 
comprises the iu. pastorls AOX1 promoter and transcription 
initiation site, an approximately 700 bp segment encoding 

10 the hepatitis B virus surface antigen (HBsAg) and an 
approximately 300 bp segment which comprises the P- 
pastoris A0X1 gene polyadenylation signal and site- 
encoding segments and transcription terminator. The 
HBsAg coding segment of the 2.0 kbp fragment is 

15 terminated, at the end adjacent the 1.0 kbp segment with 
the AOX1 promoter, with an EcoR I site and, at the end 
adjacent the 300 bp segment with the AOX1 transcription 
terminator, with a StuI site, and has its subsegment 
which codes for HBsAg oriented and positioned, with 

20 respect to the 1.0 kbp promoter-containing and 300 bp 

transcription terminator-containing segments, operatively 
for expression of the HBsAg upon transcription from the 
A0X1 promoter. The EcoR I site joining the promoter 
segment to the HBsAg coding segment occurs just upstream 

25 (with respect to the direction of transcription from the 
A0X1 promoter) from the translation initiation signal- 
encoding triplet of the AOX1 promoter. 

Plasmid pA0801 was cut with Cla l and mixed with T4 
ligase and the approximately 2.0 kbp clal-site-terminated 

30 fragment from pBSAGI5I. The ligation mixture was used to 
transform E^ coli HC1061 to ampicillin resistance, and 
transformants were screened for a plasmid of the expected 
size (approximately 5.8 kbp) which, upon digestion with 
Cla l and Bglll, yielded fragments of about 2.32 kbp (with 

35 the origin of replication and ampicillin-resistance gene 
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from pBR322) and about 1.9 kbp, 1.48 kbp, and 100 bp. On 
digestion with Eglll and EcoRl, the plasmid yielded an 
approximately 2.48 kbp fragment with the 300 bp 
terminator segment from the AOX1 gene and the HBsAg 
5 coding segment, a fragment of about 900 bp containing the 
segment from upstream of the A0X1 protein encoding 
segment of the A0X1 gene in the AOX1 locus, and a 
fragment of about 2.42 kbp containing the origin of 
replication and ampicillin resistance gene from pBR322 

10 and an approximately 100 bp Cla l -Bal ll segment of the 
AOX1 locus (further upstream from the AOXl-encoding 
segment than the first mentioned 900 bp EcoR I- Bcrll l 
segment) . Such a plasmid had the Cla l fragment from 
pBSAGISI inserted in the desired orientation. Had it 

15 been inserted in the opposite unde sired orientation, 

there would have been EcoR I -Bgl ll fragments of about 3.3 
kbp, 2.38 kbp and 900 bp. One of the transformants 
harboring the desired plasmid, designated pA0802, was 
selected for further work and was cultured to yield that 

20 plasmid. 

Plasmid pA0802 was then treated to remove the HBsAg 
coding segment terminated with an EcoR l site and a StuI 
site. The plasmid was digested with Stu I and a linker of 
sequence: 

25 5 1 -GGAATTCC-3 • 

3«-CCTTAAGG-5 f 
was ligated to the blunt ends using T4 ligase. The 
mixture was then treated with EcoR l and again subjected 
to ligating using T4 ligase. The ligation mixture was 

30 used to transform £^ coli MC1061 to ampicillin resistance 
and transformants were screened for a plasmid of the 
expected size (5.1 kbp) with Eco RI -Bcrl li fragments of 
about 1.78 kbp, 900 bp, and 2.42 kbp and Bcrlll -Cla l 
fragment of about 100 bp, 2.32 kbp, 1.48 kbp, and 1.2 

35 kbp. A transformant with the desired plasmid was 



WO 92/04363 



PCT/US91/06452 



-34- 

s elected for further work and was cultured to yield 
PA0803. 

Plasmid pA0804 was prepared from pA0803 by 
inserting, into the BamHI site from pBR322 in pA0803, an 
5 approximately 2.75 kbp figlll fragment from the £^ 

P ast 9rt? EISA gene. See, e.g. , cregg et al. (1985) Mol. 
Cell, pjoXy i>, 3376 and European Patent Application 
Publication Nos. 180,899 and 188,677. Plasmid pA0803 was 
digested with fiafflHI and combined with the HIS4 gene- 

10 containing Balll site-terminated fragment and the mixture 
subjected to ligation using T4 ligase. The ligation 
mixture was used to transform coli MC1061 to 
ampicillin-resistance and transformants were screened for 
a plasmid of the expected size (7.85 kbp), which is cut 

15 by Sail. One such transformant was selected for further 
work, and the plasmid it harbors was designated pA0804. 

Plasmid pA0804 has one £alI-ClaI fragment of about 
1.5 kbp and another of abut 5.0 kbp and a Cla l -cia T 
fragment of 1.3 kbp, indicating that the direction of 

20 transcription of the HIS4 gene in the plasmid is the same 
as the direction of transcription of the ampiciilin 
resistance gene and opposite to the direction of - 
transcription from the Aoxi promoter. 

The orientation of the HI£4 gene in pA0804 is not 

25 critical to the function of the plasmid or any 

derivatives that include heterologous DNA fragments 
inserted at the IcoRI site between the AOXl promoter and 
terminator fragments. Thus, a plasmid with the HIS4 gene 
in the orientation opposite to that of the HIS4 gene in 

30 pA0804 could be used in place of pA0804. 

D. Construction of plasmid pA0815 

Plasmid pA0815 was constructed by mutagenizing 
plasmid pA0807 to change the Clal site downstream from 
the AOXl transcription terminator in pA0807 into a BamH l 

35 site. The oligonucleotide used for mutagenizing pA0807 
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had the following sequence: 

5"-GAC GTT CGT TTG TGC GGA TCC AAT GCG GTA GTT TAT- 3 • 
(Sequence ID No. 10) « 

The mutagenized plasraid was designated pA0807-Bam. 
5 Plasmid pA0804 (described above and in International 
Patent Application WO 89/04320) was digested with B3III 
and 25 ng of the 2400 bp fragment were ligated to 250 ng 
of the 5400 bp Ball l fragment from Bal ll-diqested pA0807- 
Bean, The ligation mix was transformed into MC1061 cells 
10 and the correct construct was .verified by digestion with 
£§£I/EamHI to identify 6100 and 2100 bp sized bands. The 
correct construct was called pA0815. The restriction map 
of the expression vector pA0815 is shown in Figure 7. 

EXAMPLE 2 

15 CONSTRUCTION OF VECTORS FOR EXPRESSION OF IGF-1 IN F^ 
PASTORIS. 

The expression vector constructions were carried out 

using standard procedures, as described, for example by 

Haniatis £fc al. ((1982) Molecular Cloning : A Laboratory 

20 Manual f Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, New York, USA) and Davis et al. ((1986) Basic 

Methods in Molecular Biolocrv r Elsevier Science 

Publishing, Inc., New York). 

A* Construction of expression vectors that contain 
25 dna encoding aMF prepro-lys-arg-IOF-i. 

Expression vectors were prepared that contain one or 

multiple copies of the IGF-1 gene expression cassette, 5' 

AOXl -aMF prepro-lys-arg-IGF-1 - 3* A0X1 , in which the aMF 

prepro-encoding element contains the aMF prepro-lys-arg 

30 but lacks glu-ala spacer-encoding nucleotides. 

l. Construction of pIGF20i which contains a 
single copy of the expression cassette. 

The synthetic gene encoding human IGF-1 shown in 

Figure 1 as a Hin dlll -BamH I fragment, was incorporated 

35 into vect r pUC18 and was ;.sed to transform EU_ coli 

strain MC1061. Ampicillin-resistant transformants were 
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selected and screened by examination of restriction 
enzyme-digested DNA for the presence of a Hin dlll-BamHT 
insert of approximately 240 bp, the size expected for the 
IGF-l gene. One transformant with an insert of this size 
5 was used to prepare plasmid DNA designated pIGFlOl. 

The 240 bp Eindlll-SajiHI fragment (250 ng) isolated 
from pIGFlOl, containing the IGF-l gene, was inserted 
into the Bindlll-garaHi site of plasmid pAO203 (10 ng) , 
which contains DNA encoding an EcoR l site at the 5 • end, 

10 the aMF pre-pro region, followed by nucleotides which 
encode the amino acids for three processing sites, lys- 
arg and (glu-ala) 2 , and including a Hindlll site at the 3 f 
end. The resulting plasmid was used to transform coli 
MC1061 cells. Ampicillin-resistant colonies were 

15 selected and screened with an oligonucleotide 

complementary to sequence 162 to 132 of the IGF-l gene, 
Sequence ID No. 1. One colony which was positive in this 
screen was selected and its plasmid designated pIGF102. 
The IcoRI-gamHI fragment from pIGF102 (250 ng) , 

20 containing the <*MF pre-pro region and proteolytic 

processing sites, and IGF-l gene, was cloned into M13mpl9 
(10 ng) and used to transform coli JM103 cells. The 
resulting transf ormants were screened by analysis of 
restriction enzyme-digested DNA, and one clone *(plGF103) 

25 with an insert of the correct size (480 bp EcoRl -BamH i 
fragment) was used to prepare single-stranded DNA for 
site-directed mutagenesis. Site-directed mutagenesis of 
the single-stranded DNA was performed to delete the (glu- 
ala) 2 spacer sites, the HindHI cloning site, the 

30 polylinker attached to the synthetic gene, and the codon 
for the initial methionine of IGF-l. Mutagenesis was 
accomplished using standard procedures and 
oligonucleotides having the following sequences: 

autagenizing oligonucleotide (Sequence ID No. 

35 2): 
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5 , m;tatctttggataaaagaggaccggagacgctctgc-3 • 
screening oligonculeotide (Sequence ID No. 3): 
5 • -ATAAAAGAGGACCGGA-3 • . 
Removal of the above noted sequences yielded a fusion 
5 gene containing DNA encoding the aMF pre-pro region and 
lys-arg processing site fused directly to the coding 
region of the IGF-1 gene. 

The selected clone, designated pIGF104, was 
sequenced to verify the changes and then subjected to a 
10 second site-directed mutagenesis to insert an EcoR I site 
immediately following the translation termination codon 
of the IGF-1 gene. Oligonucleotides having sequences as 
follows were used in this mutagenesis: 

mu tag en i zing oligonucleotide (Sequence ID No* 

15 4): 

5 1 -AGTCAGCTTGATAAGAATTCAAATGAGTCGACCTGCAGGC-3 • 
screening oligonculeotide (Sequence ID No. 5): 
5 1 -TAAGAATTCAAATGAGT-3 • . 
The mutagenized clone was designated pIGF105. 

20 After the second mutagenesis was confirmed by DNA 

sequencing, the aMF-IGF-1 gene fusion was isolated on a 
450 bp EsfiRI fragment, then 250 ng of this EcoR I fragment 
was inserted into 10 ng of the £j_ pastoris expression 
vector pA0815, which had been previously digested with 

25 EcoR I and treated with calf alkaline phosphatase. The 
resulting single-copy expression vector, pIGF201, 
contains one copy of the aMF-IGF-1 fusion gene under the 
transcriptional control of the Pichia pastoris A0X1 
promoter and regulatory regions, as well as the AOX1 

30 transcription termination and polyadenylation signals. 

In addition, the vector includes the Pichia pastoris HIS4 
gene used for selection in His' hosts and additional 3' 
AOXl sequences which can be used to direct integration of 
the vector into the host genome. Plasmid pIGF201 is 

35 sh wn in Figure 2. 
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The entire aMF-IGF-1 fusion gene and approximately 
50 nucleotides each of the promoter and termination 
regions of pIGF201 were sequenced to verify that the 
nucleotide sequences were not altered during the cloning 
5 process. 

2. construction of expression vectors pIGF202, 
PIGF204, and pIGF206, which contain a multiple copies of 
the expression cassette. 

The expression cassette containing the AOX1 promoter 

10 and regulatory region, the oMF-IGF-1 fusion gene, and the 
AQX1 transcription termination and polyadenylation 
signals was isolated from pIGF20l as a 1700 bp Bal ll- 
EsmHI fragment. The Bglll -BamH I expression cassette 
(250 ng) was inserted back into the unique BaiaH I site of 

15 pIGF20l (10 ng of BaffiHI digested, calf alkaline 
phosphatase-treated pIGF201) . MC1061 cells were 
transformed with the ligation. Amp* colonies were 
selected and plasmid was characterized by restriction 
digest. Analysis of restriction enzyme digests of the 

20 resulting plasmid, pIGF202 shown in Figure 3, verified 
that the two expression cassettes were joined as tandem- 
repeat units rather than inverted-repeat units: Sai l 
digest yielded ~2100, 1750, and 6100 bp bands; Clal/BamH I 
digest yielded ~3800 and 6450 bp bands. 

25 The Eglll-fiaitfKI fragment from plasmid pIGF202, 

containing two copies of the expression cassette, was 
isolated (250 ng) and inserted back into the unique fiafiHI 
site in calf alkaline phosphatase-treated pIGF2 02 (10 ng) 
to yield vector pIGF204 shown in Figure 4, containing 

30 four copies of the expression cassette. MC1061 cells 
were transformed with the ligation. Amp R colonies were 
selected and plasmid was characterized by restriction 
digest. Correct plasmid demonstrated "6600 and 6900 bp 
bands upon digestion with Cla l and BamH I. 

35 To construct expression vector pIGF206, which 

contains six copies of the expression cassette, the 
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Balll-fiamHI fragment from pIGF202 (250 ng) was cloned 

into the BamH I site of pIGF204 (10 ng; pIGF204 had been 

previously treated with calf alkaline phosphatase) . 

HC1061 cells were transformed with the ligation. Amp* 

5 colonies were selected and plasmid was characterized by 

restriction digest. Restriction digests of the vector DNA 

were examined to verify the number of expression 

cassettes and that the expression cassettes were joined 

as tandem-repeat units. Correct plasmid yielded bands of 

10 '6600 and 10300 bp upon digestion with Cla l and BamH I. 

Plasmid pIGF206 is shown in Figure 5. 

B« Construction of expression vectors containing 
geMF prepro-lys-arg-glu-ala-lGF-l . 

IGF-1 expression vectors containing DNA encoding the 

15 ocMF prepro sequence, including the lys-arg and one glu- 
ala processing sequence, fused to the IGF-1 coding 
sequence were prepared from pIGFlOS, which contains the 
otMF prepro-lys-arg-IGF-1 fusion gene in M13mpl9 (see 
Example 2.A.1). Site-directed mutagenesis of single- 

20 stranded DNA from pIGF105 was performed to insert codons 
for a glu-ala spacer site between the otMF prepro-lys-arg 
and IGF-1 DNA sequences. Mutagenesis was accomplished 
using standard procedures and oligonucleotides having the 
following sequences: 

25 Mutagenising oligonucleotide (Sequence ID No. 6) 

5 • -TCTTTGGATAAAGAGAGGCTGGACCGCAGACGCTC- 3 • 
Screening oligonucleotide (Sequence ID No. 7) 
5 » -AAAAGAGAGGCTGGACCGC-3 ■ . 
The selected clone was sequenced to verify that the DNA 

30 encoding the glu-ala residues was added correctly. The 
otMF prepro-lys-arg-glu-ala-IGF-1 gene fusion was isolated 
from the selected clone on a 465-bp EcoR I fragment and 
ligated to EcoR I -digested f alkaline phosphatase-treated 
vector pA0815, The resulting single copy expression 

35 vector, pIGF816, contains on c py of the otMF prepro-lys- 
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arg-glu-ala-IGF-1 gene fusion under transcriptional 
control of the £. pastoris AOX1 promoter and regulatory 
regions / as well as the AQXl transcription termination 
and polyadenylation signals. The vector also includes 
5 the £♦ pastoris HIS4 gene for selection in His* hosts and 
additional AQXl sequences. The entire aMF prepro-lys- 
arg-glu-ala-IGF-l gene fusion and approximately 30 
nucleotides each of the promoter and termination regions 
of pIGF816 were sequenced to verify that the sequences 

10 were not altered during the cloning process. 

Expression vectors containing two or more copies of 
the aMF prepro-lys-arg-glu-ala-IGF-1 gene cassette can be 
prepared as described for construction of vectors 
containing multiple copies of the aMF prepro-lys-arg-lGF- 

15 l gene expression cassette (see Example 2. A, 2.) . 

EXAMPLE 3 

INTRODUCTION OP THE EXPRESSION CASSETTES FROM THE IGF-1 
EXPRESSION VECTORS INTO STRAINS OF PI CHI A PASTORIS . 

A. Mut* strains 

20 Expression vectors pIGF201 f pIGF202, pIGF204, and 

pIGF20€ and pIGF816 were used to develop IGF-l-expressing 
Mut* strains of Fichia pastoris. The Mut phenotype refers 
to the methanol utilization ability of the strain. Mut* 
strains consume methanol at a rate similar to that of 

25 wild-type strains. A HIS4 mutant of Pichia pastoris, 
GS115 (ATCC No. 20864), was used as the host for all 
transformations, which were accomplished by either the 
spheroplast method (performed as described in U.S. Patent 
No. 4,879,231) or by the whole-cell lithium chloride 

30 yeast transformation system (Ito. et al. (1984) Aoric . 
£i£l* Chem. £&: 341) with modification necessary for 
adaptation to methylotrophic yeast, such as £. pastoris 
(see European Patent Application No. 312,934; also 
available as U.S. Patent No. 4,929,535). 

35 Mut* strains were generated by integration of the 
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entire expression vector into either the AOX1 or HIS4 
locus of the host genome by an additive homologous 
recombination event. Plasmid pIGF201 was transformed 
into GS115 as an undigested circular vector and allowed 
5 to integrate randomly into the AOXl locus at either the 
5' or 3* regions homologous to sequences contained in the 
plasmid, or into the HIS4 locus. For site-directed 
addition to the HIS4 locus, the multi-copy expression 
vectors pIGF202, pIGF204, and pIGF206 and the single copy 

10 vector pIGF816 were digested with Stu I, which linearizes 
the plasmids by cutting the vector within the HIS4 
region. Additive integration at either the AOXl or HIS4 
locus does not disturb the AOXl gene. 

The Mut + transformants resulting from additive 

15 integration of the expression plasmids, either randomly 
(PIGF201) or by site-direction (pIGF202, pIGF204, pIGF206 
and pIGF816) , were initially screened for histidine 
prototrophy. Prototrophic strains generated by separate 
transformation of GS115 with each of the four plasmids 

20 were selected for further analysis. 

Ten of the His* transformants resulting from 
transformation of GS115 with pIGF201 were analyzed by 
Southern blot hybridization to verify the site of 
integration of the expression plasmid and number of 

25 copies of the plasmid integrated. Chromosomal DNA from 
the 10 transformants was separately digested with EcoR I 
and Bal ll r separated by agarose gel electrophoresis, and 
transferred to nitrocellulose. The EcoR I digests were 
probed with pBR322-based plasmids containing either the 

30 AOXl 5 f and 3 1 regions or the Pichia pastor is HIS4 gene. 
The Bcrl ll digest was probed with an oligonucleotide 
homologous to the IGF-1 gene. 

Sixteen of the His* transformants generated by 
transformati n of GS115 with pIGF202, 16 generated by 

35 transformation with pIGF204, and 26 generated by 
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transformati n with pIGF206 were also analyzed by 
Southern blot hybridization to verify the site of 
integration and integrity of the multi-copy expression 
vector. Chromosomal DNA was digested with Bql ll and 
5 probed with plasmids containing the AQXl 5* and 3* 

regions or HTS4 gene, and also separately digested with 
SfcSil and probed with the oligonucleotide homologous to 
the IGF-1 coding sequence. 

Analysis of DNA from ten pIGF201 transformants by 

10 Southern blot hybridization indicated that four 

transformants contained a single copy of the expression 
vector integrated at the. &22£1 locus and two transformants 
contained multiple copies of the plasmid integrated at 
the HIS4 locus* The four other strains contained the 

15 plasmid integrated at unknown loci. It was not possible 
to determine the exact number of copies of the plasmid 
integrated at HI§4. Southern analysis of DNA revealed 
that 10 of the pIGF202 transformants, 9 of the pIGF204 
transformants, and 13 of the 26 piGF206 transformants 

20 contained a single copy of the respective expression 
vector integrated at HIS4- The other transformants 
contained plasmids integrated at unknown loci. 

The following representative strains generated by 
transformation of GS115 with vectors pIGF201 f p!GF202, 

25 pIGF204 or pIGF206 were selected for further analysis: 



35 



Strain Name 


Plasmid 
Integrated 


Site of 
Integration 


Number of 
Plasmids 
Integrated 


Expression 

Cassette 
Copy Number 


G+IGF201S1 


PIGF201 


AOXi 


1 


1 


G+IGF206S2 


PIGF206 


HIS4 


1 


6 


G+IMB202S2 


PIGF202 


PJS4 


1 


2 


G+IMB204S14 


PIGF204 


SIM 


1 


4 


G+IMB206S1 

In addit; 


PIGF206 
Lon, tour str 


HIS1 

ams oanerate 


1 

ifi hv 


6 
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tr ems formation with pIGF8l6, strains G+IGF816S1, 
G+IGF816S2, G+IGF816S9 and G+IGF816S11, were also 
analyzed further during growth in one-liter 
fermentations. 
5 B. Mut* 8 trains 

In Mut" strains, the expression vector integrates 
into the AOX1 locus and disrupts it. Thus, Mut" strains 
consume methanol at a much slower rate than Hut 4, strains* 
To generate Muf strains, plasmid pIGF206 was 

10 digested with Bglll. This liberates a fragment comprised 
of the six IGF-1 expression cassettes, the HIS4 gene for 
selection, and the AOX1 3' region. Both ends of this 
fragment contain long sequences which are homologous to 
the 5» and 3 1 ends of the AOX1 locus. Upon 

15 transformation of GS115 hosts with the gglll-ended 

fragment, integration into the AOXl locus by homologous 
recombination results in substitution of the Ealll-ended 
fragment for the AOXl structural gene. Positive 
transf ormants were selected by their His* phenotype and 

20 by their Mut" phenotype, slow growth on methanol. 

Selection is accomplished by plating His* transformants on 
minimal glucose (2%) master plates to obtain colonies 
originating from single cells. After overnight 
incubation at 30 °C, the masters were replica-plated to 

25 minimal glucose plates and plates containing no carbon 
source to which methanol was added in vapor phase. This 
is accomplished by adding approximately 200 fil of 
methanol to the underside of the top of a covered petri 
dish. The plates were incubated at 30 °C for 4-6 days 

30 with additional methanol added in the vapor phase every 
two days. Colonies showing visible growth were scored as 
Mut 4 and those with no visible, or slow, growth were 
scored as Mut". Approximately 15% were slow growers, 
which is indicative of disruption of the AOXl gene. 

35 Following the initial screening for methanol 



WO 92/04363 PCT/US91/06452 



-44- 



utilization and histidine prototr phy, chromosomal ONA 
from three Hut:" transformants was analyzed by three 
different Southern blots to verify the site of plasmid 
integration and number of copies integrated. 
5 Chromosomal DNA from Hut" transformants was digested 

with EcoR I and probed with pBR322-based plasmids 
containing either the AOXl 5* and 3 f regions or the 
Pichia pastor is HIS4 gene. An additional blot of Bglll- 
digested DNA was probed with the plasmid containing the 

10 IGF-l gene. Southern analysis of the three Hut" 

transformants generated using plasmid pIGF206 revealed 
that all three contained a single copy of the Bal ll 
fragment, containing six expression cassettes, integrated 
by replacement at the AOXl locus. 

15 The strains were characterized as follows: 

Number of Expression 

Site of Plasmids cassette 

Strain Name Integration Integrated copy number 
G-IMB206S1 AOXl 1 6 
20 G-IHB206S2 AOXl 1 6 
G-IHB206S3 AOXl 1 6 

C Preparation of protease-d ef icient strains for 
expression and secretion of IGF-l, 

25 The use of protease -deficient strains of P. 

pastoris hosts for recombinant expression of heterologous 
proteins susceptible to degradation by pastoris 
proteases is described in U.S. Patent Application No. 
07/678,916. Protease-def icient P. pastoris strains have 

30 been generated by disrupting £^ pastoris genes, -such as 
PEP4 and PRB-1 , that encode proteins that directly or 
indirectly affect protease activity of the cell. 
Disruption of these genes results in a reduction of at 
least a portion of the protease activities in the cell, 

35 including proteinase A, carboxypeptidase Y and proteinase 
B activities. Recombinant Pep4" (also called pep4 1 and 
Pep4'Prb-l* (also called pen4 prb-1 IGF-l-secreting 



a 



WO 92/04363 PCT/US9 1/06452 

-45- 

strains of £^ pastoris have been prepared. 

Four protease-def icient IGF-1 expressing strains of 
P. Pastorls , strains M+IMB206S1, M+IGF816S1, M+IGF816S4 
and C+IGF816S1 were generated using two different 
5 approaches. Strain M+IMB206S1 was developed by 

transformation of a L, pastoris strain containing six 
copies of an aMF prepro-lys-arg-IGF-1 gene expression 
cassette (strain G+IMB206S1, see Example 3. A. ) with a 
PEP4 gene disruption vector, pDR421. Vector pDR421 

10 (described below and in detail in U.S. Patent Application 
Serial No. 07/678,916) contains an internal portion of 
the pastoris PEP4 gene which integrates into the host 
genome at the ££Ei locus to generate two incomplete and 
nonfunctional copies of the PEP4 gene. 

15 Strains M+IGF816 SI,. M+IGF816S4 and C+IGF816S1 were 

generated by transforming protease deficient £^ pastoris 
host strains with the aMF prepro-lys-arg-glu-ala-IGF-l 
gene construct. Strains M+IGF816S1 and M+IGF816S4 were 
generated by transformation of a pep 4 host and C+IGF816S1 

20 by transformation of a pep4 , prb-l host. 

l. Construction of the £x pastoris PEP 4 gene 
disruption vector pDR421. 

Plasmid pDR421 was constructed for use in the 

preparation of PEP4 -def icient (Pep4~) strains of 

25 pastoris in which the host PEP 4 gene is disrupted by 
addition of an incomplete PEP 4 gene to the endogenous 
PEP 4 locus. Plasmid pDR421 contains an internal portion 
of the PEP 4 gene, which, when introduced into the host 
genome at the PEP 4 locus, results in two incomplete and 

30 nonfunctional copies of the PEP 4 gene. 

To construct pDR421, the URA 3 gene of pastoris 
was cloned into vector pEP205. Plasmid p£P205 contains 
pUC19 sequences and the portion of the PEP 4 gene encoded 
by the approximately 450 bp fiamHl fragment derived fr m 



WO 92/04363 



PCI7US91/06452 



PEP202. Plasmid pEP202 was prepared by inserting a 10; 6 
kb £coRI fragment of the K pastoris genome r that had 
been identified as containing the PEP 4 gene by- Southern 
blot hybridization of_£coRI -digested £^ pastoris genomic 
5 DNA with the homologous S. cerevisiae PEP4 gene as a 

probe, into pUC19. The 10.6 kb fragment was isolated and 
approximately 200 ng of it was ligated with an equal 
amount of pUC19 which had been cut with EcoR I and 
dephosphorylated. The ligation mixture was used to 

10 transform^, coli strain MC1061. Ampicillin-resistant 
colonies were selected and screened by analysis of 
restriction enzyme digests of colony DNA for the presence 
of the diagnostic 10.6 kb EcoR I fragment. A large-scale 
plasmid preparation was made from a colony containing the 

15 correct plasmid, which was named pEP2 02. Plasmid pEP202 
contains the complete pastoris PEP4 . 

The URA 3 gene was isolated from Spe l/ SphI digested 
PPU205 as a 2 kb fragment and ligated to Xbal / Sph l- 
digested pEP205 and used to transform coli strain 

20 MC1061 to ampicillin resistance. Amp* colonies were 
screened by analysis of BamH I/ Sph I digested colony DNA 
for the presence of 2.7kb, 0.4 kb and 1.9 kb fragments. 
A selected transformant that contained the correct 
plasmid construct was identified and the plasmid was 

25 designated pDR421. 

2. Preparation of K+XHB206S1 
M+IMB206S1, a pep4 IGF-1 expressing strain, was 
generated by transforming a Pen4 £^ Pastoris strain that- 
contained six copies of an aMF prepro-lys-arg-IGF-1 

30 construct with the PEP4 disruption vector pDR421, which 
contains an internal portion of the £^ pastoris PEP4 gene 
and which integrates into the host genome at the PEP4 
locus, thereby disrupting expression of the gene by 
generating two incomplete and nonfunctional copies of the 

35 gene. 
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In order to facilitate identification of 

transformants that incorporated pDR421, an auxotrophic 

marker selection system was established in strain 

G+IMB206S1 prior to transformation. This was 

5 accomplished by isolation of a colony of G+XMB206S1 that, 

through spontaneous mutation, became auxotrophic for 

uracil (Ura-) . Because pDR421 contains a £^ pas tor is 

URA3 marker gene, in addition to the PEP4 fragment, it 

was possible to initially select those cells of the Ura* 

10 derivative of G+IMB206S1 that had incorporated pDR421 on 

the basis of uracil prototrophy. The isolation of a Ura" 

IGF-l-expressing pastoris strain and its use in 

generating a PEP4 IGF-l-expressing pastoris strain 

are described in this section. 

15 a. Isolation of the Ura' IGF-1 expressing 

strain IGF-U. 

Resistance to a 5-f luro-orotic acid (5-FOA) was used 
as a means of identifying Ura" derivatives of strain 
G+IMB206S1. 5-FOA is an analog of a uracil biosynthetic 

20 pathway intermediate that, when metabolized by Ura 4 

strains, yields a toxic compound. Therefore, Ura* strains 
cannot survive on 5-FOA-containing medium. In contrast, 
the uracil biosynthetic pathway of Ura" strains is blocked 
at certain steps, and such strains do not metabolize 5- 

25 FOA and are unaffected by its toxic effects. 

Consequently, plating cells on 5-FOA-containing medium 
can be used as a method to detect Ura" strains generated 
by spontaneous mutation. 

A Ura" derivative of the IGF-l-producing strain 

30 G+IMB206S1 is isolated by direct plating of 

approximately 5 X 10 7 cells into a 5-FOA-containing medium 
supplemented with uracil (0.67% yeast nitrogen base, 2% 
agarose, 2% glucose, 750 mg/L of 5-FOA and 45 mg/L of 
uracil). After one week of incubation at 30*C, a colony 

35 growing on the plate was isolated. This colony, which 
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required uracil to grow, was unable to c mplement a ura3 
strain of Pichia pastoris and was designated IGF-U, 

b. Transformation of IGF-U with pDR42l 
Plasmid pDR421 contains the Pichia URA3 gene and a 
5 small portion of the Pichia PEP4 gene. Balll-digested 
plasmid pDR421 was used to transform IGF-U using standard 
spheroplast transformation procedures. Digestion of 
PDR421 with Bglll linearized the vector by cleaving it 
within the PEP4 gene fragment. Thus, the resulting 

10 linear DNA had 5" and 3» ends containing sequences from 
the PEP4 gene and a BB&2. gene. Because the ends of this 
Bglll fragment are homologous to adjacent sequences of 
the PEP4. gene, integration of the fragment in the genome 
of IGF-U was directed at the PEP4 locus via homologous 

15 recombination that resulted in disruption of the PEP4 
locus. The URA3 gene of the integrated DNA fragment 
conferred a Ura* phenotype on the transf ormants that 
provided a marker for strains, which were transformed. 
Ura* transf ormants were subsequently analyzed for 

20 carboxypeptidase Y (CPY) activity using a colony overlay 
calorimetric screening procedure (Jones, E. (1977) 
Genetics 85: 23-33). Colonies which appeared to have low 
carboxypeptidase Y (CPY) activity based on the results of 
this assay were isolated, subcultiired, and then 

25 rescreened using the overlay assay. One of these strains 
was designated M+IMB206S1. 

3. Preparation of M+IGF816S1 and M+IGF816S4 
Protease-deficient strains containing one or more 
copies of the aMF-prepro-lys-arg-glu-ala-IGF-1 gene 

30 expression cassette were generated by transformation of a 
Pep4" His4 - strain of P. pastoris . designated MGP21, with 
PIGF816. 

a« Generation of MGP21 

The protease-deficient host strain KGP21 was 
35 generated by gene replacement methods, as described in 
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U.S. Patent Application Serial No. 678,916, in which the 
PEP4 gene is replaced with a defective pep4 gene. 
Briefly, a His4* Ura' P. pastor is strain (GS4-2), selected 
from cells of a His4" strain (GS115) that had been plated 
5 onto 5-FOA, was transformed with a linear fragment of a 
PEP4 disruption vector designated pDR602 (see U.S. Patent 
Application Serial No. 07/678,916). The linear fragment 
of pDR602 consisted of the P. pastoris gene f lanked 

on each side with DNA coding for a portion of the £. 

10 pastoris PEP4 gene. Homology between the ends of the 

fragment and the endogenous PEP4 gene of GS4-2 stimulated 
integration of the fragment at the PEP 4 locus resulting 
in a gene replacement event in which the defective URA3 - 
containing pet>4 gene replaced the endogenous PEP4 gene. 

15 The resulting strain, MGP21, is His4" Ura* Pep4*. 

b. Transformation of MGP21 
Strain MGP21 was transformed with StuI -digested 
PIGF816 via the lithium chloride method. Because StuI 
cleaves pIGF816 in the HIS4 gene, the linear fragment was 

20 directed to integrate into the his4 locus of strain 
MGP21. Resulting transf ormants were screened for 
histidine prototrophy. Two of the strains were 
designated M+IGF816S1 and M+IGF816S4. 

4. Preparation of strain C+XGF81681 

25 A pep4 prb-1 IGF-l-expressing strain of £. pastoris 

containing one copy of the aMF prepro-lys-arg-glu-ala- 
IGF-1 expression cassette was generated by transformation 
of a peo4 orb-1 ura 3 his4 £. pastoris host strain, 
designated MG18, with pIGF816. 

30 a. Generation of MG18 

Strain MG18 was generated by disruption of the PRB-1 
gene of the pep4 ura 3 his4 £. pastoris strain GS4-2521-4- 
5 (see U.S. Patent Application Serial No. 07/678,916 for 
a detailed description of the generation of GS4-252 1-4-5 

35 from the ura 3 his4 strain GS4-2 via the intermediate 
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strain GS4-2521-4). This was accomplished by 
transformation of GS4-2521-4-5 with the PRB-l disruption 
vector pDR911. Vector pDR911 (described in U.S. Patent 
Application Serial No. 678,916) contains an internal 
5 portion of the £. pastoris PRB-l gene, which, when used 
to transform PRB-l strains of £. pastoris . integrates 
into the host genome at the PRB-l locus to generate two 
incomplete and non-functional copies of the PRB-l gene. 
Vector pDR9ll also contains a complete functional £. 

10 pastoris URA3 gene for use as a selectable marker in ura3 
host strains of £. pastoris f such as GS4-2521-4-5. 

Plasmid pDR911 was linearized by cleavage with Bal ll 
which cuts the plasmid within the fragment of the PRB-l 
gene. The linear DNA was used to transform GS4-2521-4-5 

15 according to the sphef oplast method. Because the ends of 
this linear DNA are homologous to adjacent sequences of 
the PRB-l gene, integration of the DNA into the genome of 
GS4-2521-4-5 was directed at th e PRB-l locus via a 
homologous recombination event resulting in the 

20 disruption of the PRB-l locus. The URA3 gene of the 

integrated DNA fragment conferred a Ura* phenotype on the 
trans formants that provided a marker for strains that had 
been transformed. Genomic DNA from transformants that 
were able to survive on media lacking uracil was analyzed 

25 by Southern blot hybridization. The DNA was digested 
with £coRV, seperated by electrophoresis on a 0.75% 
agarose gel and transferred to nitrocellulose. The 
filter was probed with random-primed pDR911. The band 
pattern expected for a strain in which the PRB-l gene had 

30 been disrupted was a loss of the 5 kb fragment 

representing the endogenous undisrupted PRB-l gene and 
the appearance of an approximately 10 kb fragment 
representing the endogenous 5 kb PRB-l fragement plus the 
5 kb linear vector DNA pDR911. DNA from 3 out of 40 

35 transformants exhibited this band pattern. One of these 
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strains was MG18. 

b. Transformation of XQ18 with pIGF8l6 
To develop pep4 prb-1 strains of 2* 
pastoris , strain MG18 was transformed with £tul -digested 
5 pIGF816 by the lithium chloride method. Because Stu I 
cleaves pIGF8l6 in the HIS4 gene, the linear fragment was 
directed to integrate into the his4 locus of strain MG18. 
Resulting transf ormants were screened for histidine 
prototrophy. One strain was designated C+IGF816S1. 
10 EXAMPLE 4 

GROWTH OF STRAINS IN ONE- AND TEN-LITER FERMENTATIONS . 

Medium employed in fermentations described herein 
had the following composition: 

A. 10X BASAL SALTS 

15 Chemical Grams/liter 

Phosphoric acid, 85% 42.0 ml 

Calcium Sulfate. 2H20 1.8 

Potassium Sulfate .28*6 

Magnesium Sulfate.7H20 23.4 

20 Potassium Hydroxide 6.5 

B. PTM 1 TRACE SALTS 
Chemical Grams/ liter 
Cupric Sulfate -5^0 6.0 
Sodium Iodide 0.08 

25 Manganese Sulfate *H 2 0 3.0 

Sodium Molybdate • 2H 2 0 0.2 

Boric Acid 0.02 

Cobalt Chloride 0.5 

Zinc Chloride 20.0 

30 Ferrous Sulfate •THjO 65.0 

Biotin 0.20 

Sulfuric Acid 5.0 ml 
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A. One-Liter Fermentations 

In order to maximize expression and secretion of 
IGF-l, the effects of glycerol concentration and pH on 
IGF-l production in one-liter fermentations have been 
5 investigated. 

1. Hut* protocol methanol fed-batch 
fermentation. 

The Mut* fermentation protocol included three 

separate phases: (1) cells are initially are cultured on 

10 glycerol in a batch mode; (2) following exhaustion of the 
glycerol, a limited glycerol feed is initiated such that 
glycerol does not accumulate, the AOX1 promoter is 
derepressed and cell mass increases: and (3) a methanol 
feed is initiated for production of heterologous protein 

15 in a methanol fed-batch mode. 

Prior to use the f ermentor was autoclaved with about 
one liter of medium containing 500 ml of lOx basal salts 
medium (final basal salts concentration of 5X, see above) 
and 30-50 g glycerol. After sterilization, 4 ml 

20 trace salts (see above) were added to the fermentor and 
the pH was adjusted to 5 with concentrated NH^OH. In runs 
in which the pH was maintained at 5, pH was adjusted by 
addition of 50% NH^OH containing 0.1% Struktol J673 
antifoam. Dissolved oxygen was maintained above 20% of 

25 saturation by regulating agitation and aeration, or 
supplementing the air feed with oxygen. In low pH 
fermentations in which the pH was maintained between 2.8 
and 3.5, the pH controller was adjusted to the desired pH 
either at the initiation of the limited glycerol feed or 

30 at the initiation of the methanol feed and then allowed 
to decrease by virtue of cellular metabolism to the 
desired pH. 

Inocula were prepared from buffered YNB glycerol 
plates and grown overnight at 30 C C in phosphate-buffered 
35 YNB (11.5 g/L KH^Pq,, 2.66 g/L KjHPO,, 0.67% yeast nitrogen 
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base, pH 6)) containing 2% glycerol. The fermentor was 
inoculated with these cultured cells, which had grown to 
an ODgjQ of 1-6, and the batch growth regimen was 
continued for 18 to 24 hours. At the point of glycerol 
5 exhaustion, indicated by increased dissolved oxygen / a 
glycerol feed (50% glycerol plus 12 ml/1 of PTMj) was 
initiated at a rate in the range of about 5-20 ml/h. 
This period of growth on limited glycerol was continued 
until a total volume of approximately 40-300 ml of feed 

10 was added to the fermentor. After addition of the 

glycerol feed, a methanol feed (100% methanol plus 12 
ml/L PTM,) was started. The initial rate of methanol 
addition was approximately 1-2 ml/h. The methanol feed 
rate was increased for 3 to 8 hours to 5-6 ml/h. In some 

15 fermentations the feed rate was directly increased to a 

rate of 6 ml/h after three hours of methanol feeding. 

The vessel was harvested 40-95 hours following initiation 

of methanol induction. 

2* Mut" protocol methanol fed-batch 
20 fermentation (Run 657) • 

The first two phases of the methanol fed-batch 

fermentations of the Mut* strains were conducted as 

described for the Mut 4 strain fermentations in Example 

4(A)(1). However, the methanol induction phases of the 

25 Mut* and Mut" fermentation protocols differed in terms of 
the manner in which the methanol feeds were added to the 
cultures. In standard fermentations of the Mut" strain, 
the methanol feed rate was adjusted to maintain an excess 
of methanol in the medium which did not exceed 0.3% (as 

30 determined by gas chromatography) . The methanol feed was 
initiated at 1 ml/hr and after two hours was increased in 
10% increments every 30 minutes to a rate of 3 ml/hr 
which was maintained for the duration of the 
fermentation. The vessel was then harvested after 101 

35 hours of growth of the strain on methanol. 
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B. Tan-Liter Fermentations 
1. Fed -bat oh 

a. Standard methanol feed rate (Runs 843, 

959, 960, 962, 968 and 999). ' 

5 A 15-liter fermentor containing 3.5 liters 10X basal 

salts and 220 g glycerol brought to a total volume of 5.5 
liters by the addition of water was sterilized. After 
the fermentor had cooled, 24 ml PTM, trace salts were 
added and the pH was adjusted to 5.0 with the addition of 
10 28% ammonium hydroxide. The pH of the fermentation was 
maintained by addition of the same solution and foaming 
was controlled with the addition of a 5% solution of 
StruJctol J673. Temperature was maintained at 30»C, and 
dissolved oxygen. was maintained above 20% of saturation 
15 by regulating agitation, aeration, and reactor pressure 
or by supplementing the air feed with oxygen. 

Inocula were prepared from cells grown overnight in 
buffered YNB (11.5 g/L KI^P04, 2.66 g/1 KjHPO^, 6.7 g/L 
yeast nitrogen base, pH 6) containing 2% glycerol. The 
20 fermentor was inoculated with 500-700 ml of the cultured 
cells which had grown to a concentration of 2-8 OD M0 , and 
the batch growth regimen was continued for 18-24 hours 
until glycerol was exhausted. At the point of glycerol 
exhaustion, indicated by an increase in dissolved oxygen, 
a glycerol feed (50% w/v glycerol plus 12 ml/L PTM,) was 
initiated at 100 ml/hr. After four hours, the glycerol 
feed was terminated and a methanol feed (100% methanol 
plus 12 ml/L PTM,) was initiated at a rate of 20 ml/hr. 
After 304 hours of methanol feeding, the methanol feed 
30 rate was increased to 60 ml/hr and maintained at this 
rate for the remainder of the fermentation. The vessel 
was harvested approximately 72 hours after the initiation 
of the methanol feed. 

In pH 5 fermentations, the pH of the culture wa6 
maintained at 5 throughout the fermentation, in low pH 



25 



35 
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fermentations ( i.e. , pH 2.8-3.0), the set point of the pH 

controller was adjusted to the target pH and the pH of 

the culture was allowed to decrease to the new set point 

as a result of cellular metabolism. In some 

5 fermentations, the 6et point of the pH controller was 

adjusted at the initiation of the glycerol feed, whereas 

in other fermentations, the pH set point was not lowered 

until initiation of the methanol feed, 

b. Modified methanol feed rates (Runs 
10 789, 810 and 906) • 

Additional 10-liter fermentations were conducted 

essentially as described in Example 4.B.l.a. except with 

modified methanol feed rates. In Run 789, the methanol 

feed rate was steadily increased as the cell density 

15 increased during the fermentation. After the initial 

three hours of methanol feeding at 20 ml/hr, the feed 

rate was increased to 30 ml/hr. The feed rate was then 

increased twice daily in proportion to the increase in 

cell mass. 

20 In Run 810, the methanol feed rate was increased to 

60 ml/hr after an initial four hours at 20 ml/hr. After 
21 hours of methanol feeding, the feed rate was then 
increased to 100 ml/hr and maintained at the rate for the 
remainder of the fermentation (51 hours) . 

25 In Run 906, a methanol feed rate of 40 ml/hr was 

maintained for the first 60 hours of feeding, and then 
increased to 60 ml/hr for the remainder of the 
fermentation (90 hours total) . 

2. Continuous culture (Run 929). 

30 In Run 929, the fermentation process was switched 

from a fed-batch to a continuous mode after 20 hours of 
methanol feeding. This was accomplished by initiating a 
3X basal salts feed along with the methanol feed and 
maintaining a constant volume in the f ermentor by 

35 c ntinuous removal of whole broth (including cells). The 
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methanol feed was maintained at a constant rate f 70 
ml/hr and the basal salts feed rate was set at 70% of the 
total feed (i.e., methanol plus basal salts) rate. Thus, 
a total feed of 230 ml/hr, containing 30% methanol, was 
5 fed into the ferment or with a constant total volume of 8 
liters. /Continuous culture was carried out for 139 hours 
(total of 159 hours on methanol) . 
C. Results 

Samples (15-ml aliquots) of the ferment or culture 

10 were removed from the f ermentor at various times 

throughout the course of the fermentation. Aliquots of 
each sample were centrifuged at 6500 x g for 5 min to 
separate broth and cells. The levels of the NH 4 OH, 
ant i foam, glycerol and methanol reservoirs were recorded 

15 at these time points. Methanol and ethanol 

concentrations in the supernatant were determined by gas 
chromatography using a PorapakQ column (Alltech, 
Deerf ield, IL) . 

In addition, the wet weight of the culture was 

20 determined as an indicator of cell growth in the 

f ermentor. For this purpose a one-ml aliquot of the 
fermentor culture was centrifuged for four minutes in a 
microfuge, the supernatant was decanted, and the wet 
pellet was weighed. 

25 The IGF-1 levels of the cell-free broth were 

determined by RIA, quantitative immunoblot or reverse* 
phase HPLC (see Example 5 for a description of each of 
these methods) . The results of fermentations of the IGF- 
1-expressing strains of £^ pastorls are provided in the 

30 Tables I-V. 

TABLE I 

Results of Quantitative Immunoblot Analyses 
of Cell-Free Broth 
from One-Liter Fermentations 
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RUN 


STRAIN 
NAME 


EXPRESSION 
CASSETTE 

COPY 
NUMBER 


P H 


GLYCEROL 
ADDED 

(o) 


CELL DENSITY 


HOURS 

ON 
MeOH 


IGF-1 
(mg/U 


WET WEIGHT 
GLYCEROL 


(0/U 
MeOH 


573 


G+IGF201S1 


1 


5 


100 


224 


308 


49 


11 


576 


G+IGF206S2 


6 


5 


100 


198 


280 


49 


117 


578 


G+IGF206S2 


6 


3.5* 


160 


328 


360 


101 


284 


596 


G+IGF206S2 


6 




150 


306 


385 


43 


655 


598 


G+IGF206S2 


6 


3.5> 


150 


298 


363 


64 


455 


605 


G+K3F201S1 


1 


3* 


ISO 


306 


415 


64 


21 


L" 


G+IGF206S2 


6 


3* 


150 


304 


344 


70 


489 



The pH was decreased from 5 to 3.5 after 26 hour* of growth on methanol. 

The pH was decreased from 5 to the Indicated pH at the initiation of the glycerol fed-batch phase. 

TABLE XI 

Results of IncStar Antisera-Based RIAs of 
Cell-Free Broth from One-Liter Fermentations 



RUN 


STRAIN 
NAME 


EXPRESSION 
CASSETTE 

COPY 
NUMBER 


PH 


GLYCEROL 
ADDEO 
(8) 


CELL DENSITY 


HOURS 

ON 
MeOH 


IGF-1 
(mg/U 


WET WEIGHT 
GLYCEROL 


(0/U 
MeOH 


596 


G+IGF206S2 


6 


3 


150 


306 


385 


48 


1860 


605 


G+IGF201S1 


1 


3 


150 


306 


415 


64 


167 


657 


G-IMB206S3 


6 


3 


150 


278 


370 


101 


746 


661 


G + IMB206S1 


6 


3 


150 


267 


320 


73 


1280 


680 


G+IM8202S2 


2 


3 


150 


270 


430 


73 


740 


691 


G + IMB204S14 


4 


3 


150 


265 


350 


92 


1400 


694 


G+IM8206S1 


6 


5 


150 


264 


32S 


94 


306 
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TABLE III 

Results of Nichols RIAs of Cell-Free 
Broth from One-Liter Fermentations 



RUN 


==== 

STRAIN 
NAME 


EXPRESSION 
CASSETTE 
COPY 
NUMBER 


PH 


GLYCEROL 
Anncn 

<0) 


CELL DENSITY 


HOURS 

ON 
MeOH 


| ,1 

IGF-1 
(mo/U 


WET WEIGHT 
GLYCEROL 


<g/U 

Me OH 


661 


G+IMB206S1 


6 


3 


150 


267 


320 


73 


140 


680 


G+IMB202S2 


2 


3 


150 


270 


430 


73 


64 


691 


G+IN®204S14 


4 


3 


" 160 


265 


350 


92 


174 


694 


G+IAffi206S1 


6 


6 


150 


254 


325 


94 


15 


755 


G+IMB204S14 


4 


2.8 


60 


160 


460 


72 


300 


756 


G+IM8204S14 


4 


2.8 


175 


314 


445 


72 


315 


757 


G+IMB204S14 


4 


2.8 


80 


201 


491 


72 


300 


772 


G+IMB206S1 


6 


2.8 


85 


181 


401 


71 


335 


773 


G+IMB204S14 


4 


2.6 


90 


200 


377 


70 


250 


775 


G+IM8204S14 


4 


2.8 


40 


104 


409 


70 


280 


776 


G + IM8206S1 


6 


2.8 


45 


116 


490 


71 


355 
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TABLE XV 

Results of HPLC Analyses of Cell-Free 
Broth from One-Liter Fermentations* 



RUN 


STRAIN 
NAME 


EXPRESSION 
CASSETTE 

COPY 
NUMBER 


PH 


GLYCEROL 
ADDED 
(8) 


CELL DENSITY 


HOURS 

UN 

Ma OH 


K5M b 
<mg/U 


WET 
WEIGHT 
GLYCEROL 


(g/u 

MeOH 


1 605 


G + IGF201S1 


1 




1 en 


306 


415 


64 


14 


1 680 


G + IMB902S2 


e 
* 




«rn 
• 9U 


270 


430 


73 


39 


I 691 


G+IMB204S14 


4 


3 


150 


265 


350 


92 


101 


694 


G + IMB206S1 


6 


5 


150 


254 


325 


94 


3 


767 


G+IMB204S14 


4 


2.8 


80 


201 


491 


72 


74 


772 


G + IMB206S1 


6 


2.8 


85 


181 


401 


71 


121 


834 


M+IMB20BS1 


8 


2.8 


60 


132 


450 


72 


139 


935 


M+IMB206S1 


6 


5,0 


50 


123 


363 


72 


167 


943 


G+IMB204S14 


4 


5.0 


SO 


125 


330 


72 


0 


1039 


G+IGF816S1 


1 e 


3.0 


50 


158 


320 


49 


16 


1047 


M+IGF816S1 


r 


3.0 


50 


123 


282 


69 


16 


1048 


G + IGF816S2 


i« 


3.0 


50 


134 


447 


*3 


17 


1049 


G+K3F816S9 


mufti 8 


3.0 


50 


131 


380 


69 


44 


1050 


G+IGF816S11 


mutti 0 


3.0 


50 


131 


386 


94 


9 


1053 


M+IGF816S4 


muW 


3.0 


50 


143 


393 


70 


33 


1055 


C+IGF816S1 


r 


3.0 


50 


143 


312 


70 


20 


1066 


C + IGF816S1 


V 


5 


50 


123 


234 


70 


20 


1067 


M+IGF816S4 


multr* 


5 


50 


132 


331 


70 


so 


1069 


G+IGF816S11 


muttr 3 


5 


50 


132 


323 


70 


0 



Cell-free broth was patted through • cotton exchange chromatography column prior to analysis by 
HPLC (see Example 4CJ. 

Concentration of authentic IGM Q.e.< intact, correctly folded monomark IGF-1). 

The number of expression cassettes contained In these strains is estimated based on IGF-1 

expression levels. 
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TABLE V 

Ten-Liter Fermentations 





STRAIN 
NAME 


EXPRESSION 
CASSETTE 
COPY 


PH 


GLYCEROL 
ADOS) 
(0) 


CELL DENSITY 


HOURS 

ON 
Mo OH 


IGM 
(mg/U 


9 RUN 


WET 
WEIGHT 
GLYCEROL 


(Q/U 
MeOH 


789 


G+IMB204S14 


4 


2.8* 


420 


107 


584 


60 


290°/90 - 


810 


G+IMB204S14 


4 


2.8- 


420 


181 


595 


51 


305775 - 


843 


6+1MB204S14 


4 


rs* 


420 


177 


482 


72 


30871 03 4 


906 


G+IMB204S14 


4 • 


2.8* 


420 


178 


471 


90 


300793 4 


929 


G+IMB204S14 


4 


2-8* 


420 


180 


400-480 


159* 


65-70* 


959 


M+IMB206S1 


6 


6.0 


420 


217 


383 


75 


54* 


960 


M + IMB206S1 


6 


2-8* 


420 


184 


387 


72 


138* 


962 


G+IMB204S14 


4 


2*8- 


420 


T88 


536 


72 


77- 


999 


M+IMB206S1 


6 


3.0* 


420 


191 


398 


72 


131 d 



The pH was deoreased from S to 2.8 at the initiation of the glycerol fed-botch phase. 
Continuous culture. 

Determined by Nichols Rf A of cod-free broth (see Example 5A). 

Authentic IGM levels determined by HFLC analysis of pretreated cell-free broth (see Example 5A>. 
The pH wa* decreased from 5 to the Indicated pH at the initiation of the methanol fed-batch phase. 



As shown in Tables I- V, the four different assays 
used to measure IGF-1 levels yielded inconsistent 
results. As discussed in Example 5, the different assays 
most likely detect different combinations of the several 
5 species of IGF-1 present in varying concentrations in P. 
pastoris broth / including: intact correctly folded 
monomeric IGF-1, referred to as authentic IGF-1; 
multimeric IGF-1: nicked IGF-1; and incorrectly folded 
IGF-1; thereby providing differing values for IGF-1 
10 concentrations of the broth. It is possible, however, to 
compare the IGF-1 levels of different fermentations when 
they are measured by the same assay. Such analyses yield 
the following conclusions. 
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The results of quantitative immunoblot analyses of 
fermentations of the six-copy strain G+IGF206S2 (Runs 
576, 578, 596, 598 and 613) and the one-copy strain 
G+IGF201S1 (Runs 573 and 605) are shown in Table I* All 
5 samples quantified by immunoblot were first reduced with 
dithiothreitol (DTT) , which disrupts disulfide bonds 
holding together multimeric or nicked IGF-1. Therefore, 
because only the protein that co-migrated with IGF-1 
standard was quantified in this assay (see Example 5A) , 

10 this method detected, but did not distinguish authentic 
monomer IGF-1, misfolded monomer IGF-1 and multimer IGF- 
1. The results shown in Table I demonstrate that 
expression cassette copy number has an effect on the 
level of immunoreactive IGF-1 contained in the broth, 

15 IGF-1 levels increase as the number of copies of the 
expression cassette contained in the strain increases 
(compare run 605 of the one-copy strain (21 mg/1) and Run 
613 of the six-copy strain (489 mg/1)). A comparison of 
the IGF-1 levels of broth from fermentations conducted 

20 entirely at pH 5 (Runs 573 and 576) and the levels from 
fermentations conducted at lower pH (pH decreased from 5 
to 3-3.5 at either initiation of glycerol feed (Runs 598, 
605 and 613) or during methanol fed batch phase (Run 
578) ) reveals that the greatest amount of immunoreactive 

25 IGF-1 was obtained when the fermentation was conducted at 
lower pH. Furthermore, the time at which the pH was 
lowered from 5 to £3.5 also had an effect on IGF-1 
levels. Lower IGF-1 levels were obtained when the pH of 
the fermentation was decreased later in the fermentation, 

30 i.e., during the methanol fed batch phase ( compear e Runs 
598 and 578). The results of several fermentations of 
the six-copy strain conducted according to the same 
protocol (addition of 150 g of glycerol, 60-70 hours of 
methanol feeding, and decrease in pH from 5 t 3 at the 

35 initiati n f glycerol feeding) dem nstrat d the 
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reproducibility of the IGF-1 production process, since 
all fermentations typically yielded between approximately 
350-480 mg/1) - 

The results of Incstar antisera-based RXAs (Table 
5 II; see Example 5A for a description of this assay) of 
broth from fermentations of the six-copy strain 
G+IMB206S1 (Run 661) , the four-copy strain G+IMB204S14 
(Run 691) , the two-copy strain G+IHB202S2 (Run 680) and 
the one-copy strain G+IGF201S1 (Run 605) further 

10 illustrate the dramatic effect of expression cassette 
copy number on IGF-1 expression by £^ pastor is . The 
four-and six-copy strains produced similar levels of RIA- 
reactive IGF-1 (1400 and 1280 mg/1, respectively) and 2- 
and nearly 10-fold more IGF-1 than the two- and one-copy 

15 strains (740 and 167 mg/1, respectively) • Furthermore, 
the Mut* six-copy strain G+IMB206S1 appeared to be 
superior to the Mut" six-copy strain (G-IMB206S3) in terms 
of IGF-1 production capability (compare Runs 661 and 
657) . Incstar antisera-based RIAs of broth from Runs 661 

20 and 694 (strain G+IMB206S1) were in agreement with the 

finding that pH 5 fermentations yield significantly lower 
levels of IGF-1 than low pH fermentations. 

Although Nichols RIAs consistently yielded lower 
values for IGF-1 expression levels than the Incstar 

25 antisera-based RIA, the relative IGF-1 levels from 

different fermentations, as measured by Nichols RIA, were 
in agreement with those as measured by Incstar antisera- 
based RIAs. For instance, a comparison of the IGF-1 
levels of fermentations of the two-, four-, and six-copy 

30 strains G+IHB202S2 (Run 680) , G+IMB204S14 (Run 691) and 
G+IMB206S1 (Run 661), respectively, as determined by 
Nichols RIA, further demonstrated the effect of 
expression cassette copy number on IGF-1 expression 
levels. Furthermore, the level of IGF-1 product 

35 determined by Nichols RIA was signif icantly higher in 
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fermentations conducted at lover pH: compare Run 694 (pH 
5; 15 mg/1) and Run 661 (pH 3; 140 xng/1) and Run 776 (pH 
2.8; 355 mg/1). Finally/ it appears that changing the 
amount of glycerol added during the glycerol batch and 
5 fed batch phases had little effect on the final IGF-1 

concentration from fermentations of the f our-and six-copy 
strains (compare Runs 772 (85 g glycerol added) and 776 
(45 g glycerol added) which both produced approximately 
345 mg/1; also compare Runs 756, .756, 757, 773 and 775 
10 (glycerol added carried from 40-175 g) which yielded 250- 
300 mg/1) . 

HPLC analysis (see Example 5C) of fermentation broth 
permitted distinction and accurate measurement of each 
species of IGF-1 present in the broth. Table XV lists 

15 the levels of authentic IGF-1, as- determined by reverse- 
phase HPLC of broth from one-liter fermentations of IGF- 
1-expressing strains of P. pastor is . The results of 
these HPLC analyses aire consistent with those of RIA and 
immunoblot analyses of broth. in terms of the relative 

20 IGF-1 levels in broth from different strains grown under 
different conditions. As shown in Table IV, expression 
cassette copy number clearly influences IGF-1 expression 
levels (compare IGF-1 levels of broth from a one-copy 
strain, G+IGF20161 (14 mg/1, Run 605) , a two-copy strain, 

25 G+IMB202S2 (39 mg/1, Run 680), a four-copy strain, 

G+IMB204S14 (101 mg/1, Run 691) and a six-copy strain, 
G+IMB206S1 (121 mg/1, Run 772)) In addition, the results 
in Table IV demonstrate the effect of pH on production of 
IGF-1 in £- pastor is : fermentations of strains G+IMB206S1 

30 and G+IMB204S14 conducted at pH 5 (Runs 694 and 943, 

respectively) yielded essentially no authentic IGF-l (or 
any form of IGF-1) ; whereas 121 and 74 mg of IGF-1/ 1 were 
produced in low pH fermentations of these strains (Runs 
772 and 757, respectively, pH was decreased from 5 to 2.8 

35 at the initiati n of the glycerol fed batch phase) . Th 
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results of HPLC analysis of broth from several different 
low pH fermentations of strain G+IMB204S14 in which the 
amount of glycerol added to the fermentor varied between 
40 and 175 g confirmed the finding that differing 
5 glycerol levels in this range had no effect on IGF-1 
production; all fermentations yielded 70-110 mg/1. 

The broth from one-liter fermentations of the 
protease-deficient (ESE4) strain M+IMB206S1 was also 
analyzed by HPLC. As shown in Table IV, pH 5 and low pH 

10 fermentations (Runs 935 and 934 , respectively) of this 
strain yielded similar high levels (167 and 139 mg/1, 
respectively) of authentic IGF-1, Furthermore, the level 
of authentic IGF-1 produced in these fermentations of 
strain M+IMB206S1 was approximately 60% higher than that 

15 produced in pH 2.8 fermentations of the four-copy strain 
PEP4 G+IMB204S14 (Run 757; 74 mg/i) and approximately 15% 
higher than that produced in pH 2.8 fermentations of the 
six-copy PEP4 strain G+IMB206S1 (Run 772; 121 mg/1) . 

The results of HPLC analysis of broth from one-liter 

20 fermentations of strains containing one or more oeMF 
prepro-lys-arg-glu-ala-IGF-l expression cassettes are 
also shown in Table IV. These strains showed two 
different levels of IGF-1 expression. Strains G+IGF816S1 
and G+IGF816S2 yielded similar levels of authentic IGF-1 

25 (approximately 16 mg/1) in two identical fermentations 
(Runs 1039 and 1048). In contrast, strains G+IGF816S9 
and G+IGF816S11 produced three-fold more authentic IGF-1 
(approximately 45 mg/1) in Runs 1049 and 1050. 
Therefore, it appears that strains G+IGF816S9 and 

30 G+IGF816S11 fortuitously incorporated 2-3 copies of the 
ctMF prepro-lys-arg-glu-ala-IGF-1 expression cassette into 
their genomes; whereas strains G+IGF816S1 and G+IGF816S2 
apparently contain a single copy of the expression 
cassette. 

35 Broth from fermentations of protease-deficient 
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strains M+IGF816S1 ( pep4 ) , M+IGF816S4 f pep4 ) and 
C+IGF816S1 ( pep4 ETfezl) containing one or more otMF 
prepro-lys-arg-glu-ala-IGF-l expression cassettes (see 
Example 2C) also was analyzed by HPLC. Based on the 
5 authentic IGF-1 levels of the broth from low pH 

fermentations of strains M+IGF816S1 and C+IGF816S1 (Runs 
1047 and 1055, 16-20 mg/1) as compared to those of the 
broth from low pH fermentations of strain M+IGF816S4 (Run 
1053, 33 mg/1) , it appears that strains M+IGF816S1 and 

10 C+IGF816S1 contain one copy of the expression cassette; 
whereas M+IGF816S4 contains two or more expression 
cassettes. It is noteworthy that pH 5 fermentations of 
these strains (Runs 1066 and 1067) yielded at least as 
. much authentic IGF-1 as low pH fermentations of these 

15 strains. In contrast, pH 5 fermentations of a PEP4 

strain containing multiple copies of the oMF-lys-arg-glu- 
ala-IGF-1 expression cassette (Run 1069) yielded no 
authentic IGF-1. 

As shown in Table V, high-level production of IGF-1 

20 by P. pastor is was readily scaled up to the 10-liter 
fermentation level. A comparison of low pH 10-liter 
fermentations of the four-copy strain G+IMB204S14 
conducted with varying methanol feed rates (Runs 789, 
810, 843, and 906) revealed similar levels of authentic 

25 IGF-1 production. A significant level of authentic IGF-1 
(54 mg/1) was produced in a pH 5.0 10-liter fermentation 
of the pep4 strain H+IMB206S1 (Run 959) . This level was 
increased in 10-liter fermentations of this strain that 
were initiated at pH 5 and then conducted at pH 2.8 or 

30 3.0 during the entire methanol fed batch phase (Runs 960 
and 999, respectively). A similar fermentation of strain 
G+IMB204S14 (Run 962) yielded IGF-1 levels equivalent to 
those obtained in a fermentation in which the initial pH 
of 5 was deer ased to 2.8 upon initiation of the glycerol 

35 fed batch phase (Runs 789, 810, 843 and 906). Continuous 
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culture of strain G+IMB204S14 (Run 929) produced between 
55 and 70 mg/1 of authentic IGF-1 at a rate of 230 ml of 
culture media per hour. Therefore, the productivity of 
continuous culture fermentations was slightly higher than 
5 that of fed-batch fermentations. 

EXAMPLE 5 

MEASUREMENT OF IGF-1 CONCENTRATIONS IN THE CULTURE 
MEDIUM. 

A* RIA Assay. 

10 A 1:5000 final dilution of rabbit anti-human IGF-1 

ant i sera (Incstar anti-somatomedin C antibody, catalog 
#22275), 10,000-12,000 cpm of 12S I-IGF-1 (Incstar catalog 
#22303) , and various dilutions of recombinant human IGF-l 
standard (purchased from Imcera and quantified by amino 

15 acid analysis) or the unknown broth solution were 

incubated overnight at 4°C in a final volume of 0.5 ml in 
12 x 75 mm polystyrene tubes. At the end of the 
incubation, 100 /il of Pansorbin (working dilution of 
1:40) was added to the tubes and incubated for 15 minutes 

20 at room temperature. Two milliliters of RIA buffer (50 
mM NaP0 4 , 0.1% BSA, 0.1% NaN 3 , and 0.1% Triton X-100, pH 
7,4) were added to each tube before centrifugation at 
3200 rpm for 68 minutes at 4 °C in a Beckman J6M 
centrifuge. Following centrifugation, the supernatant 

25 was decanted and the radioactivity associated with the 
pelleted material was determined in a gamma counter. 

Alternatively, some of the samples were assayed 
using a commercial RIA (Nichols Institute Diagnostic; San 
Juan Capistrano, CA>. The Nichols assay consistently 

30 measured lower levels for IGF-1 than the RIA described 
above (see data above from Runs 661, 680, 691 and 694) . 
This apparent discrepancy probably arises because the 
antibody used in the RIA described above detects both 
monomer and multimer IGF-1, whereas the antibody used in 

35 the Nichols assay most likely detects monomer IGF-1 only. 
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B. Western blot 

Immunoblot analysis of £^ pastoris fermentation broth was 
used to qualitatively and quantitatively evaluate IGF-1 
production in £^ pastoris . 
5 Several dilutions of recombinant human IGF-1 

standard (Imcera) and samples of cell-free broth obtained 
at the conclusion of selected fermentations of the Mut* 
and Mut* IGF-l-expressing strains of Pichia pastoris were 
analyzed by quantitative western blots to estimate the 

10 amount of immuno-reactive IGF-1 contained in the broth. 
The proteins were first separated by Tricine SDS-PAGE 
(Schegger, H. and von Jagow, G. (1987) Anfrli pj-ochera, 
117 : 304-310) and then transferred to 0.1 ^m 
nitrocellulose by electroblot in a solution of Towbin 

15 buffer (25 mM Tris-HCl, pH 8.3, 190 mM glycine, 20% 

methanol) for at least 90 minutes at 20 V/cm. After the 
proteins were transferred onto nitrocellulose, the filter 
was incubated for one hour in blocking buffer (0.25% 
gelatin, phosphate-buffered saline, 0.05% Tween 20, 0.02% 

20 sodium azide) . Rabbit anti-IGF-1 antisera 10A was 

diluted 1:2000 with blocking buffer and incubated with 
the filter for a minimum of two hours. Antibody 10A was 
raised against a synthetic peptide corresponding to the 
last 14 amino acids of the car boxy terminus of human IGF- 

25 1, which was conjugated to human a-globulin. The 

antisera was adsorbed to cr-globulin prior to use. The 
filter was washed with blocking buffer for an hour and 
incubated with ie5 I-Protein A (0.02 fiCi/ml) for 45 
minutes. After one hour of washing with blocking buffer, 

30 the filter was air dried and exposed to X-ray film with 
an intensifying screen at -75°C. To estimate the IGF-1 
content of broth, the intensities of bands corresponding 
to IGF-1 in lanes containing broth were compared to the 
intensities of bands corresponding to different known 

35 amounts of IGF-1 standard. Autoradiographs were used as 
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templates and the portions of the gels corresponding to 

bands were excised and the radioactivity in the bands 

counted in a scintillation counter. A standard curve was 

prepared by plotting the cpm of bands containing known 

5 amounts of IGF-1 versus IGF-1 concentration was prepared 

and the IGF-1 content of the broth was estimated by 

comparison with the standard curve. 

c. Quantitative Reverse-Phase High-Performance 
Liquid Chromatography (HPLC) 

10 1. HPLC System 

A Waters (Bedford, MA) 600 solvent delivery system, 
Waters Model 481 Lambda Max variable wavelength detector, 
Wisp 71 OB auto-injector and a Schimadzu Crom-Pac 
integrator (Cole Scientific, Moorepark, CA) constituted 

15 the HPLC system. A Vydac C4 column (0.46 X 5 cm) with a 
guard column was used to resolve components of the Pichia 
pastorls -produced IGF-1 preparations. Broth samples that 
had been pretreated as described in Example 6.C.2. below 
were loaded onto the column at a flow rate of 1 ml/minute 

20 and were eluted in a trif luoroacetic acid 

(TFA) /acetonitrile-TFA gradient. The eluaiit was prepared 
by using mobile phase A (0.1$ TFA) to dilute mobile phase 
B (95% acetonitrile, 5% water, 0.1% TFA). A 1%/minute 
gradient of 25% to 42% mobile phase B was passed through 

25 the column during a period of 17 minutes at a flow rate 
of 1 ml /minute to elute the material that had been loaded 
onto the column. The column was then regenerated with 
100% mobile phase B at a flow rate of 2 ml/minute for 4 
minutes followed by 25% mobile phase B for 4 minutes at 2 

30 ml/minute. The flow rate was then reduced to 1 ml/minute 
and the column was equilibrated at the initial starting 
conditions (25% mobile phase B) for 2 minutes before 
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reinjection with another sample to be analyzed. The 
detector was set at 0.05 absorbance units full scale 
(AUPS) and a wavelength of 215 run was used for maximum 
sensitivity • 

5 2. Pre treatment of crude fermentation broth. 

p. Pastoris -produced IGF-1 exists as several forms 
in the broth of fermentations of IGF-l-expressing 
nastoris strains, HPLC analysis of crude cell-free broth 
from fermentations of IGF-l-expressing Rj_ pastoris does 

10 not adequately resolve the various IGF-1 species. In 
order to distinguish these IGF-1 species by HPLC, the 
authentic P^. pastoris proteins must be removed from the 
broth by a small-scale cation exchange chromatography 
step. Several cation exchange systems were tested for 

15 this purpose: sulfylpropyl cation exchange capsules (FMC 
(Pinebrook, NJ) and Cuno (Meriden, CT)) and the use of a 
bulk cation exchanger (e.g. , S-Sepharose Fast Flow 
(Pharmacia, Uppsala, Sweden), SP-Spherodex (IBF, 
Columbia, MD) , or Toyopearl SP650M and SP550C (Toso 

20 Haas, Woburn, MA) ) in a 2-ml disposable polypropylene 
column (0.8 x 4 cm, BioRad) with an integrated 10-ml 
reservoir. Any of these systems yielded satisfactory 
results. The two systems routinely used to pretreat crude 
broth for quantitative HPLC analysis of the IGF-1 levels 

25 employed the Cuno cation exchange capsule or the SP- 
Spherodex or SP550C cation exchangers in a column format. 
The crude broth was cleaned up by cation exchange 
chromatography and injected directly onto an HPLC column. 
The resulting chromatogram clearly resolved the different 

30 IGF-1 species in broth. 

a. Pretreatment using cation exchange 

capsules . 

The Cuno cation exchange capsule is a 2 5 -mm disk. 
It was first washed with a 4 ml 0.2 M acetic acid, then 
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equilibrat d with 4 ml 0.02 M acetic acid. A volume of 
crude broth (1-10 ml) was diluted 1:2 with 0.02 M acetic 
acid. A volume of crude broth (1-10 ml) was diluted 1:2 
with 0.02 M acetic acid and loaded onto the disk. After 
5 loading, the disk was washed with 1.5 ml 0.02 M acetic 
acid and the IGF-1 was eluted with 4 ml 0.02 M sodium 
acetate, Ph 5.5, plus 1 M NaCl. The first 1.5 ml of 
eluate contained 75-80% of the total IGF-1 and was 
usually the only elution volume collected. The capsule 
10 could then be regenerated by washing with 4 ml of 100% 
methanol. 

b. Pre-treatment using bulk cation 
exchanger in a column format. . 

To the disposable column, 0.25 ml of 

15 prehydrated cation exchanger was added. The absorbent 
was first washed with 2 ml of 0.2 M acetic acid, then 
equilibrated with 2 ml of 0.02 M acetic acid. A volume 
of broth (1 ml) was loaded onto the column which was then 
washed with 1 ml of 0.02 M acetic acid. All buffers and 

20 broth were carefully added to the column in an effort to 
avoid disruption of the column bed. Although some 
suspension of the absorbent usually occurred during 
addition of liquid to the column, it was not detrimental 
to sample binding or eluting. Broth samples and buffer 

25 were allowed to flow through the column by gravity. The 
IGF-1 was eluted with 2 ml of a 0.05% M sodium acetate, 
pH 5.5, buffer containing l M NaCl. The first milliliter 
of eluate contained 80-90% of the total IGF-1, eluted by 
the 2 ml of elution buffer. Periodically (approximately 

30 every 5-10 samples) , the column was regenerated after the 
salt elution with a 50% methanol wash. Less often, the 
column was also regenerated with 0.5 M NaOH. These 
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columns retain their selective binding properties through 
many successive uses. 

3. Measurements of IGF-1 concentrations. 
In order to measure the concentrations of Pichia - 
5 produced IGF-1 by HPLC, known amounts of standard IGF-1 
(Amgen, Thousand Oaks, CA) and authentic Pichia -produced 
IGF-1, which was purified as described in U.S. Patent 
Application Serial No. 07/641,430) and quantified by 
amino acid composition analysis were injected onto the 

10 HPLC column and the area under the corresponding peaks in 
the chromatogram was measured. A standard curve was 
generated by plotting area versus micrograms of IGF-1 
loaded onto the HPLC column. A conversion factor for use 
in converting the area under HPLC chromatogram peaks to 

15 IGF-1 concentration was calculated from the standard 
curve. Using this information, it was possible to 
determine the concentration of authentic IGF-1 in a 
pretreated broth sample by measuring the area under the 
corresponding peak on the chromatogram from HPLC analysis 

20 of the sample. This conversion factor was also used to 
estimate the approximate concentration of other IGF-1 
species as well. The absolute concentrations of each of 
these other species, however, may vary depending on 
differences in their specific conversion factors. 

25 EXAMPLE 6 

CHARACTERIZATION OF THE IGF-1 SECRETED INTO THE CULTURE 
MEDIUM 

Pichia -produced IGF-1 was characterized by 
immunoblot, SDS-PAGE, HPLC, and protein sequence 
30 analyses. These methods and the results of these 
analyses are described in this example. 
A. Immunoblot Analysis 

Several characteristics of the IGF-1 produced in 
fermentations of IGF-l-expressing £^ pastoris were 
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ascertained from immunobl t analysis of reduced (by 
adding dithiothreitol (DTT) ) and non-reduced (no added 
DTT) samples of broth (performed as described in Example 
SB) - 

5 First, if the broth samples were not reduced prior 

to electrophoresis, the IGF-1 migrated as several forms 
which appeared to be a monomer, dimer and various 
multimers of IGF-1. The ratio or profile of monomer to 
multimer forms of secreted IGF-1 did not seem to be 

10 affected by copy number. Second, the immunoblot of 
reduced samples revealed the absence of most of the 
higher molecular weight immunoreactive species that were 
evident in the nonreduced broth samples, and revealed the 
presence of a protein that co-migrates with IGF-1 

15 standard (monomer) as well as immunoreactive protein 
which migrates to a position slightly below that of 
standard IGF-1. N-terminal protein sequence analysis 
(Example 6.d.) of this lower molecular weight 
immunoreactive species demonstrated that it begins with 

20 residue 25 of mature human IGF-1 and, thus, is a 
proteolytic fragment of IGF-1. Since this lower 
molecular weight species is not seen under non-reducing 
conditions, the IGF-1 molecule may be nicked only between 
residues 24 and 25, i.e. , not completely fragmented, with 

25 the disulfide bonds at Cys-6/Cys-48 and Cys-18/Cys-61 of 
IGF-1 holding the peptide fragments that contain amino 
acids 1-24 and 25-70 together. Further, the decreased 
glycerol fermentation (Run 755) produced wholly intact 
IGF-1 molecules (no degradation) , while the extended 

30 glycerol feed fermentation (Run 756) demonstrated a low 
level of degraded product. The degradation appeared to 
be slightly higher with the six-copy strain (Run 776) 
than with the f our-copy strain (Run 775) . 
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IGF-1 produced in the fermentation of the six-copy 
Mut" strain (Run 657) was also analyzed by SDS-PAGE and 
Western blot. The IGF-1 band patterns of reduced and 
non-reduced broth from the fermentation of the six-copy 
5 Mut" strain are similar to the band patterns of reduced 
and non-reduced broth from the fermentations of the six- 
copy and four-copy Mut + strains. 

B. Qualitative HPLC Analysis 

Crude pastoris fermentation broth was also 

10 analyzed by HPLC (see Example 5.C.1 for a description of 
the HPLC system used in this analysis) . Direct injection 
of crude broth onto the HPLC column did not result in a 
chromatogram with distinct peaks. In order to analyze 
the components of crude broth by HPLC, it was necessary 

15 to remove endogenous £^ pastoris contaminants from the 
broth using small-scale cation exchange chromatography, 
as described in Example 5.C. 

The eluate obtained after small-scale cation 
exchange chromatography of cell-free broth from a low pH 

20 fermentation of an IGF-l-expressing strain of £^ pastoris 
contains all the IGF-1 polymers, including nicked, 
misfolded, multimeric and authentic IGF-1 which can be 
resolved by the HPLC protocol described in Example 5C. 
The different peaks detected in the chromatogram from 

25 HPLC analysis of broth that had been subjected to cation 
exchange chromatography correspond to different forms of 
IGF-1 produced in recombinant Ei. pastoris fermentations. 
The identity of the proteins corresponding to these peaks 
was established in HPLC, SDS-PAGE, immunoblots, gel 

30 filtration, cation exchange and hydrophobic interaction 
chromatography (performed as described in US Patent 
Application Serial No. 641,430). 

A chromatogram from HPLC analysis (conducted as 
described in Example SC.) of broth from a fermentation of 

35 an IGF-l-expressing £j_ pastoris strain contains a peak 
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corresponding to protein that eluted from the HPLC column 
at approximately 10 minutes, which represents correctly 
folded, intact IGF-1 monomer. The identity of this 
protein was confirmed initially on the basis of its 
5 elution time by HPLC, which is identical to that of 
standard recombinant IGF-1 (Amgen, Thousand Oaks, CA) . 
Furthermore, the protein with an elution time of 
approximately 10 minutes was purified (see U.S. Patent 
Application Serial No, 641,430) and subjected to 

10 additional analyses. SDS-PAGE analysis of reduced and 
non-reduced samples of the purified protein yielded 
identical results, revealing that it was a 7.7-kDa intact, 
protein that co-migrated with IGF-1 standard. Immunoblot 
analysis of the purified protein demonstrated that it is 

15 reactive with an antibody generated against the last 14 
amino acids of IGF-1. Gel filtration chromatography of 
the purified protein revealed that it elutes as expected 
for an IGF-1 monomer of the correct size. Finally, amino 
acid analysis confirmed that the amino acid ratios of the 

20 purified protein correspond to those of standard IF-1. 
Protein sequence analysis showed that the complete amino 
acid sequence of the purified protein is identical to 
that of authentic IGF-1. 

The protein that elutes from the HPLC column at 

25 approximately 8.6 minutes was tentatively identified as 
misfolded IGF-1. This protein was isolated by HPLC and 
hydrophobic interaction chromatography and characterized 
by SDS-PAGE, immunoblot and protein sequence analysis. 
SDS-PAGE analysis of reduced and non-reduced samples of 

30 this protein demonstrated that this form migrated with 
authentic monomeric IGF-1 and that it was not a nicked 
form of IGF-1. Immunoblot analysis of this protein using 
an antibody directed against the C-terminus of IGF-l 
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shoved that it was immunoreactive. Amino- terminal 
protein sequencing of this protein also confirmed that 
the molecule was intact since only one amino- terminal 
sequence could be identified. These results suggest that 
5 this form is a misfolded species of IGF-1. 

The proteins that elute from an HPLC column at 10.5- 
11.5 minutes were identified as nicked or degraded forms 
of IGF-1 (i.e., IGF-1 molecules containing two or more 
peptide fragments, generated by cleavage of one or more 

10 peptide bonds and held together by disulfide bonds) . 

There appears to be at least two peaks by HPLC analysis 
of cleaned up broth that correspond to the nicked IGF-1. 
The protein represented by the major peak (eluting at 
10.7 minutes) was isolated during the S-Sepharose cation 

15 exchange step of the IGF-1 purification process (see U.S. 
Patent Application Serial No. 641,430). SDS-PAGE 
analysis of non-reduced samples of this isolated species 
revealed that it co-migrated with IGF-1 standard and 
appeared as a single band. Gels of reduced samples of 

20 this protein, however, exhibited a doublet representing 
two peptides of approximately 3-4 kDa each (approximately 
half the size of intact IGF-1) . The position of this 
doublet in the gel corresponded to that of the lower of 
the two bands detected below the band representing intact 

25 IGF-1 in gels of reduced samples of crude broth. These 
results indicate that this molecule is monomeric IGF-1 
cleaved or nicked and held together by disulfide bonds. 
Amino-terminal protein sequence analysis of the protein 
confirmed that the molecule is nicked prior to residue 40 

30 of IGF-1. Immunoblot analysis of reduced and non-reduced 
samples of this isolated nicked IGF-1 molecule revealed 
that it is less reactive than intact IGF-1 with the 
antibody directed against the Oterminus of IGF-1. 

Two additional nicked species were identified in 

35 protein sequence analysis of IGF-1 recovered from the 
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f irst cation exchange chromatography step of the 
purification process (see U.S. Patent Application Serial 
No. 641,430). Either or both of these species could 
correspond to the minor peak in the HPLC chromatogram of 
5 cleaned up cell-free broth (protein eluting at 11 

minutes) . The amino-terminus of the C-tierminal fragment 
of one of these nicked molecules begins at residue 25 of 
IGF-1. The amino terminus of the Oterminal fragment of 
the other nicked species detected in the broth begins at 

10 residue 14 of IGF-l. 

The last set of proteins detected in HPLC analysis 
of cell-free broth, which elute from the HPLC column 
after 11.5-16 minutes, appears to be disulf ide-bonded 
multimer forms of IGF-1. The presence of disulf ide- 

15 bonded IGF-1 multimers in £^ pastoris broth was indicated 
in SDS-PAGE gels of broth and immunoblots of the gels. 
The putative multimers migrated as IGF-1 diners and 
trimers on non-reduced SDS-PAGE gels and were reactive 
with antibody directed against the C-terminus of IGF-1. 

20 When these multimers were reduced, they co-migrated with 
standard monomer IGF-1 in SDS-PAGE gels, which indicates 
that contain disulf ide-bonded IGF-1 monomers. The amino 
acid sequence generated in protein sequence analysis of 
gel-isolated dimer was identical to the N-terminus of 

25 authentic IGF-1. Furthermore, multimer IGF-1 (apparent 
dimer and trimer species) were isolated on a gel 
filtration column and analyzed by HPLC. The isolated 
multimer eluted from the column at 12-14 minutes, which 
corresponds to the elution times of the proteins in broth 

30 that were proposed to be multimers of IGF-1. 

EXAMPLE 7 

CHARACTERIZATION OF PURIFIED AUTHENTIC PICHIA-PRODUCED 
IGF-1. 

Correctly folded, intact, monomer ic IGF-1 (authentic 
35 IGF-1) was purified from the broth of IGF-l-expressing 
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strains G+IMB204S14 and M+IMB206S1 using a combination of 
cation exchange chromatography, hydrophobic interaction 
chromatography and gel filtration chromatography methods. 
The purification procedure is described in detail in U.S. 
5 Patent Application Serial No. 07/641,430. The purified 
material was characterized by HPLC, gel filtration 
chromatography, SDS-PAGE and immunoblot analysis, and 
amino acid, composition and sequence analysis. 

A. HPLC Analysis 

10 Several different dilutions of the purified 

preparation were subjected to HPLC using the system and 
procedure described in Example 5C. A single major peak 
was detected in the resulting chromatogram which 
represented a protein with an elution time identical to 

15 that of IGF-1 standard. 

B. Gel Filtration Chromatography 

1. Procedure 

A Superdex 75 HR 10/30 gel filtration column 
(Pharmacia) with a bed volume of 24 ml (10 x 300 mm) was 

20 used in gel filtration chromatography methods of 

characterizing IGF-1 on the basis of size. The HPLC 
system described in Example 5C was modified for use with 
this column. The detector was set at a wavelength of 280 
nm and a sensitivity of 0.02 AUFS. The gel filtration 

25 chromatography column buffer contained 0.05 M ammonium 
acetate, pH 6. Buffer was flowed through the column at a 
flow rate of 0.5 ml/minute. 

2. Results 

The chromatogram from get filtration chromatography 
30 analysis of 200 /ig of purified IGF-1 yielded a single 
peak representing protein that co-eluted with IGF-1 
standard. 

C. SDS-PAGE and immunoblot Analysis. 

The purified material was examined by SDS-PAGE and 
35 immunoblot methods as d scribed in Example SB. Only a 
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single band was detected in both silver-stained gels and 
immunoblots of reduced and non-reduced samples of the 
material containing 1 tig of protein or less. This 
protein co-migrated with IGF-1 standard. 
5 D. Amino Acid Composition 

Purified IGF-l was acid hydrolyzed, and the amino 
acids were characterized on a Beckman (Palo Alto, CA) 
6300 Amino Acid Analyzer. To acid hydrolyze the IGF-1 
protein, carefully measured volumes of purified IGF-1 

10 solution were added to 6 x 50 mm glass tubes and dried in 
a Savant (Farmingdale, NY) Speed Vac. These tubes were 
placed in a reaction flask containing "0.5 ml 6 N HC1. 
Oxidation was minimized by applying a vacuum and sealing 
the flask. The flask was placed overnight in a 110°C 

15 oven, and the protein was hydro lyzed by the HC1 vapors. 

Following hydrolysis, the reaction flask was cooled 
to room temperature, and the hydro lyzate was removed. 
Any HC1 that may have condensed in the tubes was removed 
by drying the tubes in a Speed Vac. The free amino acids 

20 were dissolved in a minimum of 100 /il Beckman Amino Acid 
Sample Dilution Buffer, Na-S, for loading in the 50-/il 
loop of the analyzer. A Nelson (Cupertino, CA) 3000 
Series Chromatography Data System was used to measure 
concentrations by comparing the integrated chromatogram 

25 of amino acid standard solutions and of the resuspended, 
hydrolyzed sample. 

Table VI shows the amino acid ratios estimated for 
the purified IGF-1 and the actual published amino acid 
ratios for human IGF-1. The estimated and published 

30 ratios are in close agreement, and slight deviations in 
the estimated and published compositions are within the 
expected limits of this analysis. 
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TABLE III 



Purified IGF-l 
Amino Acid Composition Analysis Data 

5 Published 

Amino Acid Composition 6 Experimental 



Asp (+Asn) 



5 5*8 



Thr 3 3 • 1 

10 Ser 5 4.9 

Glu(+Gln) 6 6.9 

Pro 5 3 • 0 



Gly 



7 7.4 



Ala 6 6.8 

15 cys 6 5.5 

Val 3 2 * 6 

Met 1 °- 9 

He 1 °- 66 

Leu 6 7 * 1 

20 Tyr 3 2wl 

Phe 4 3 • 6 

His -0 0 

3 3 * 3 



Lys 
Arg 



6 6.7 



25 Trp 0 



"Derived from nucleotide sequence published by Rotwein et al^. (1986) 
J, Biol. Chem. 261 :4828 . 

30 E. Amino Acid Sequence 

l. Procedure 
To determine the N-terminal amino acid sequence of 
the IGF-l purified from the broth of strain M+IMB206S1, 
samples of this material were loaded directly onto an 

35 Applied Biosystems (Foster City, CA_ 470/120 Gas Phase 

Protein Sequencer. Sequencing was performed according to 
the methods described by Hunkapiller and Hood (Science 
215! 650 (1983)). 

To determine the entire amino acid sequence of IGF-l 

40 purified from the broth of strain G+IMB204S14, the 

material was sequenced from the N-terminal amino acid 
through as much of the remainder of the protein sequence 
as possible. This analysis generated the sequence of the 
first 59 residu s f the purified prot in. Because the 
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amino acid at residue 59 is a methionine residue, and 
cyanogen bromide cleaves proteins after methionine 
residues, it was possible to isolate the peptide 
containing the C-terminal 11 amino acids (residues 60-70) 
5 of purified IGF-1 for use in completing the protein 
sequence analysis of the purified material. The c- 
terminal 11 amino acids of the purified IGF-1 were 
obtained as a peptide fragment isolated from cyanogen 
bromide-treated IGF-1 by HPLC, using the same C4 column 

10 as described in Example 5C. This fragment was loaded 
onto the protein sequencer to generate the sequence of 
the C-terminal amino acids (amino acids 60-70) - 

The N-terminal amino acid sequences of authentic 
IGF-1 from the broth of fermentations of strains 

15 G+IGF816S1 and G+IGF816S11 were also determined. The 
broth was pre-treated as described in Example 5C and 
subjected to HPLC (also as described in Example 5C) • The 
fraction that eluted from the HPLC column with an elution 
time identical to that of standard IGF-1 was applied to 

20 an SP-Spherodex column to remove the HPLC solvent 

(acetonitrile and TFA) . The eluate fraction containing 
the IGF-1 was then loaded directly onto the protein 
sequencer. 

2. Results 

25 The first 30 amino acids of the IGF-1 purified from 

the broth of strain M+IMB206S1 and the entire amino acid 
sequence of IGF-1 purified from broth of strain 
G+IMB204S14 were identical to the corresponding amino 
acids of human IGF-1, Sequence ID No. 1. 

30 Similarly, the first 15 amino acids of the IGF-1 

from the broth of strains G+IGF816S1 and G+IGF816S11 were 
identical to the corresponding amino acids of human IGF- 
1. These results confirm that mature IGF-1 is correctly 
processed from the aMF prepro-lys-arg-IGF-1 or otMF 

35 prepro-lys-arg-glu-ala-IGF-1 precursor in Ej. pastoris . 
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Since modificati ns will be apparent t those of 
skill in the art, it is intended that this invention be 
limited only by the scope of the appended claims. 
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SEQUEHCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Brierley, Russell A. 

Davis, Geneva R. 
Holtz, Gregory C. 
Glees on, Martin A. 
Howard, Bradley 0. 

(it) TITLE OF INVENTION: Production of Insulin-Like Growth 
Factor- 1 in Hethylotrophie Yeast Cells 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fitch. Even, Tabin & F tannery 
(8) STREET: 135 South LaSalle Street, Suite 900 
(C) CITY: Chicago 
<D> STATE: Illinois 
(E> COUNTRY: USA 
(F) ZIP: 60603 

(v) COMPUTER READABLE FORK: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-OOS/HS-DOS 

(0) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vl) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
<B> FILING DATE: 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/578,728 
<B> FILING DATE: 04 -SEP -1990 

Cviii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidaan, Stephanie L. 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE/DOCKET NUMBER; 51874 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619)552-1311 

(B) TELEFAX: (619)552-0095 



(2) INFORMATION FOR SEQ ID 110:1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAM) EON ESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cONA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 14.. 232 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

AAGCTTACCT GCC ATG GGA CCG GAG ACG CTC TGC GGG GCT GAG CTC GTG 49 
Met Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val 
1 5 10 

GAT GCT CTG CAG nC GTG TGT GGA GAC AGG 6GC TTT TAT TTC AAC AAG 97 
Asp Ala Leu Gin Phe Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys 
15 20 25 
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CCC ACA GGG TAT GGC TCC AGC ACT CGA CGG GCG CCT CAG ACA GGC ATC 145 
Pro Thr 6ly Tyr Gly Ser Sep Ser Arg Arg Ala Pro Gtn Thr Gly He 
30 35 40 

GTG GAT GAG TGC TGC TTC CGG AGC TGT 6AT CTA AGG AGG CTC GAG ATG 193 
Val Asp Glu Cys Cys Phe Arg Ser Cys Asp Leu Arg Arg Leu Olu Net 
45 50 55 60 

TAT TGC GCA CCC CTC AAG CCT GCC AAG TCA 6CT TGA TAAGGATCCG 239 
Tyr Cys Alt Pro Leu Lys Pro Ala Lys Ser Ala * 
65 70 

A 240 



(2) INFORMATION FOR SCO 10 MO: 2: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DMA (genomic DMA) 



(xl) SEQUENCE DESCRIPTION: SEO ID NO:2: 

GTATCTTTGG ATAAAAGAGG ACCGGAGACG CTCTGC 36 

(2) INFORMATION FOR SEQ ID NO: 3: 

Ci) SEQUENCE CHARACTERISTICS; 
CA) LENGTH: 16 base pairs 
(8) TYPE: nucleic acid 
CO STRAWBWESS: single 
CO) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DMA (genomic DMA) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
ATAAAAGAGG ACCGGA 16 
(2) INFORMATION FOR SEQ 10 N0:4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic DMA) 



(xi) SEQUENCE DESCRIPTION: SEQ 10 KO:4: 

AGTCAGCTTG ATAAGAATTC AAATGAGTCG ACCTGCAGGC 40 

(2) INFORMATION FOR SEQ ID 1*0:5: 

CD SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 17 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA (genomic DNA) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

TAAGAATTCA AATGACT . 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 btse pairs 
<B) TYPE: nucleic acid 

(C) STRAND ED NESS: single 

(D) TOPOLOGY: mknown 

(if) MOLECULE TYPE: DMA ( genomic DMA) 



(Xf) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCTTTGGATA AAGAGAGGCT GGACCGCAGA CGCTC 
(2) INFORMATION FOR SEQ ID N0:7: 

<{} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
<C) STRAKOEDHESS: single 
(D) TOPOLOGY: unknown 

(U) MOLECULE TYPE: DMA (genomic DMA) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
AAAAGAGAGG CTGGACCG 
(2) INFORMATION FOR SEQ ID N0:8: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED HESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DMA (genomic DMA) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8; 
AATTCGATGA 6ATTTCCTTC AATTTTTACT GCA 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: ONA (genomic DMA) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID K0:9: 
GTAAAAATTG AAGGAAATCT CATCG 
(2) INFORMATION FOR SEQ ID N0:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 
(11) MOLECULE TYPE: DMA (flenotnic DNA) 



(xl) SEQUENCE DESCRIPTION: SEQ ID KO:10: 
6ACGTTCCTT TGTGCGGATC CAATGCGGTA GTTTAT 
(2) INFORMATION FOR SEQ ID N0:11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS H base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA (genomic OKA) 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
CAGCAGATCT GCTG 

(2) INFORMATION FOR SEQ ID N0:12: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPEt nucleic acid 

(C) STRAND EO NESS: double 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic DNA) 



(Xl) SEQUENCE DESCRIPTION! SEQ 10 NO: 12: 
CAGCAGATCT GCTG 
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THAT WHICH IS CLAIMED IS: 

1. A DKA construct, comprising: 

(a) DNA encoding an insulin-like growth 
factor-1 (IGF-1); 

5 (b) a promoter region from a methanol 

responsive gene of a methylotrophic yeast operably linked 
to said DNA that encodes IGF-1; 

(c) DNA encoding a signal sequence of amino 
acids that effect extracellular secretion of IGF-1 from a 

10 methylotrophic yeast host cell, 

wherein the DNA that encodes said signal sequence is 
operably linked via DNA encoding one or more processing 
sites that are recognized and cleaved by methylotrophic 
yeast cell proteases to the DNA that encodes IGF-1 ♦ 

15 2, A methylotrophic yeast host cell, comprising the 

DNA construct of claim 1, wherein said DNA construct is 
into the genome of said host cell. 

3. A process for producing IGF-1, comprising 
culturing the yeast host cell of claim 2 under conditions 

20 whereby IGF-1 is expressed and secreted into 

extracellular medium at concentrations of 10 mg/ml or 
greater. 

4. A DNA construct, comprising an expression 
cassette that contains: 

25 (a) DNA encoding an insulin-like growth 

factor-1 (IGF-1) ; 

(b) a promoter region from a methanol 
responsive gene of a methylotrophic yeast operably linked 
to said DNA that encodes IGF-1; 

30 (c) DNA encoding the £*. cerevisiae a-mating 

factor (acMF) pre-pro sequence: 

(d) a transcription terminator functional in a 
methylotrophic yeast host cell operably linked to said 
DNA that encodes IGF-1, 

35 wherein the DNA that encodes said pre-pr sequence 
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is operably linked via DNA encoding one or more 
processing sites selected from the group consisting of 
lys-arg and lys-arg-(glu-ala) x to the DNA that encodes 
IGF-l; 

5 x is the number of repeats of glu-ala sequences and 

is less than the number of repeats that results in 
improperly cleaved IGF-l in the extracellular medium. 

5. The DNA construct of claim 4, wherein x is 1, 2 

or 3. 

10 6. A DNA construct of claim 4, further comprising 

at least one selectable marker gene and a bacterial 
origin of replication. 

7. A plasmid comprising the DNA construct of claim 

6. 

15 8. The plasmid of claim 7, selected from the group 

consisting of pIGF201, pIGF202, pIGF204, pIGF206 and 
PIGF816. 

9. The DNA construct of claim 4, wherein said the 
DNA encoding an IGF-l peptide has the sequence set forth 

20 in Figure 1. 

10. The DNA construct of claim 4, wherein said 
methylotrophic yeast from which the promoter is derived 
is a strain of Pichia uastoris. 

11. The DNA construct of claim 10, wherein said 
25 methanol-responsive gene of a methylotrophic yeast and 

the transcription terminator are both derived from the L. 
pastoris AOX1 gene. 

12. The DNA construct of claim 11 , wherein the 3'- 
and 5' -ends of the construct have sufficient homology 

30 with a target gene of a yeast host to effect site 

directed integration of said construct into said target 
gene. 

13. A DNA construct of claim 12, comprising 
multiple copies of said expression cassette. 

35 14. A DNA construct of claim 4, wherein the 3»- and 
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S'-ends of the construct have sufficient homology with a 
target gene of a yeast host to effect site directed 
integration of said construct into said target gene. 

15. A DNA construct of claim 4, comprising multiple 
5 copies of said expression cassette. 

16. The DMA construct of claim 15, wherein said 
multiple copies of said expression cassette are oriented 
in head-to-tail orientation. 

17. A DNA construct of claim 4, wherein said 

10 construct is included in the DNA in plasmids selected 
from the group consising of pIGF201, pIGF202. pIGF204, 
pIGF206 and pIGF816. 

18. A methylotrophic yeast cell, comprising the DNA 
construct of claim 4. 

15 19. A methylotrophic yeast cell, comprising the DNA 

construct of claim 9. 

20. The methylotrophic yeast cell of claim 18, 
wherein said yeast is a strain of Pichia pastoris . 

21. A. methylotrophic yeast cell, comprising the DNA 
20 construct of claim 15. 

22. A Pichia pastoris cell, comprising the DNA 
construct of claim 12. 

23. The £j_ pastoris cell of claim 22, wherein said 
cell is selected from the group consisting of strains 

25 G+IGF201S1, G4TGF816S1, G+IGF816S2, M+IGF816S1 and 
C+IGF816S1. 

24. AL pastoris cell, comprising the DNA 
construct of claim 13. 

25. The £i pastoris cell of claim 24, wherein said 
30 cell is selected from strains G+IGF206S2, G4GF816S9, 

G+IGF816S11, G+GF816S9 and G+IGF816S11 G+IMB202S2 
G+IMB204S14, G+IMB206S1, G+IMB206S2, G+IMB206S3, G- 
IMB206S1, G+IGF816S9, G+IGF816S11, M+IGF816S4 and 
M+IMB206S1. 

35 26. The E«_ pastoris cell of claim 24, wherein said 
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cell is selected from the group consisting of strains 
G+IGF206S2, G-IMB206S3, G+IMB206S1, G+IMB202S2, and 
G+IMB204S14. 

27. A culture of viable methylotrophic yeast cells, 
5 comprising yeast cells of claim 18. 

28. A culture of viable Ex. pastoris cells, 
comprising yeast cells of claim 22. 

29. A culture of viable E*. pastoris cells, 
comprising a single strain of yeast cells of claim 23. 

10 30. A culture of viable £j_ pastoris cells, 

comprising yeast cells of claim 24. 

31. A culture of viable S_u pastoris cells, 
comprising a single strain of yeast cells of claim 25. 

32. A process for producing insulin-lJJce growth 

15 factor-1 (IGF-1) peptides, comprising culturing the cells 
of claim 18 under conditions whereby said cells express 
and secrete mature IGF-1 peptide into the culture medium. 

33. A process of claim 32, wherein said 
methylotrophic yeast is a strain of pjcfrifl pastoris. 

20 34. The process of claim 32, wherein said cells are 

cultured in a medium containing methanol as a carbon 
source. 

35. The process of claim 32, wherein said cells 
have the Mut* phenotype. 
25 36. The process of claim 32, wherein said cells 

have the Mut" phenotype. 

37. The process of claim 32, wherein the cells are 
proteolytically intact and the pH of the medium is 
between about 2 and about 4. 
30 38. The process of claim 37, wherein the pH is 

less than about 3. 

39. The process of claim 32, wherein said cells 
are protease deficient and the pH of the medium is 
between about 2 and 5. 
35 40. A process for producing insulin-like growth 
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f actor- 1 (IGF-1) peptides, comprising culturing the cells 
of claim 21 under conditions whereby said cells express 
and secrete mature IGF-1 peptide into the culture medium. 

41. The process of claim 40, wherein said 

5 methylotrophic yeast is a strain of Pichia pastor is. 

42. The process of claim 40, wherein said cells are 
grown in a medium containing methanol as a carbon source. 

43. The process of claim 40, wherein said cells 
have the Mut* phenotype. 

10 44. The process of claim 40, wherein said cells 

have the Mut* phenotype. 

45. The process of claim 40, wherein the cells are 

proteolytically intact and the pH of the medium is 

between about 2 and about 4. 
15 46. The process of claim 40, wherein the pH is 

less than about 3. 

47* The process of claim 40, wherein said cells 

are protease deficient and the pH of the medium is 

between about 2 and 5. 
20 48. The strain of claim 20, wherein said strain is 

protease deficient and wherein said def icieny does not 

affect at least one protease that recognizes said 

processing site* 

49. The strain of claim 48, wherein said deficiency 
25 results in an absence or reduction in the concentration 

of one or more proteinases selected from the group 
consisting of proteinase A, carboxypeptidase Y, and 
proteinase B. 

50. The strain of claim 49, which is M+IMB206S1, 
30 M+IGF816S1, M+IGF816S4, or C+IGF816S1. 
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FIRST AMINO ACID OF 
MATURE IGF-1 



MGPETLCGAELVOALQ 
AAGCTTACCTGCCATGGGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCA 
Hindi IIBspMINcol Sad PstI 

TTCGAATGGACGGTACCCTGGCCTCTGCGAGACGCCCCGACTCGAGCACCTACGAGACGT 
10 20 30 40 50 60 

FVCGDRG FYFNKPTGYGSSS 
GTTCGTGTGTGGAGACAGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAG 

CAAGCACACACCTCTGTCCCCGAAAATAAAGTTGTTCGGGTGTCCCATACCGAGGTCGTC 
70 80 90 100 110 120 

rrapqtgivdeccfrscdlr 
tcgacgggcgcctcagacaggcatcgtggatgagtgctgcttccggagctgtgatctaag 

Sail BspMII 

agctgcccgcggagtctgtccgtagcacctactcacgacgaaggcctcgacactagattc 

130 140 150 160 170 1 80 

RLEMYCAPLKPAKSA** 

gaggctcgagatgtattgcgcacccctcaagcctgccaagtcagcttgataaggatccga 

Aval BamHI 

ctccgagctctacataacgcgtggggagttcggacggttcagtcgaactattcctaggct 

190 200 210 220 230 240 

ATTC 
EcoRI 

FIG. 1 
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